This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfac...This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfaction. It addresses a significant research gap in understanding metro passengers’ dynamics across cultural and geographical contexts. It employs questionnaires, field observations, and advanced data analysis techniques like association rule mining and neural network modeling. Key findings include a correlation between rainy weather, shorter waiting times, and higher arrival volumes. Neural network models showed high predictive accuracy, with waiting time, metro satisfaction, and weather being significant factors in Lagos Light Rail Blue Line Metro. In contrast, arrival patterns, weather, and time of day were more influential in Wuhan Metro Line 5. Results suggest that improving metro satisfaction and reducing waiting times could increase arrival volumes in Lagos Metro while adjusting schedules for weather and peak times could optimize flow in Wuhan Metro. These insights are valuable for transportation planning, passenger arrival volume management, and enhancing user experiences, potentially benefiting urban transportation sustainability and development goals.展开更多
The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, th...The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, the number of useful rules is hard to estimate. If the number is too large, we cannot effectively extract the meaningful rules. This paper analyzes the meanings of the parameters and designs a variety of equations between the number of rules and the parameters by using regression method. Finally, we experimentally obtain a preferable regression equation. This paper uses multiple correlation coeficients to test the fitting efiects of the equations and uses significance test to verify whether the coeficients of parameters are significantly zero or not. The regression equation that has a larger multiple correlation coeficient will be chosen as the optimally fitted equation. With the selected optimal equation, we can predict the number of rules under the given parameters and further optimize the choice of the three parameters and determine their ranges of values.展开更多
Extracting objects from legacy systems is a basic step in system's object orientation to improve the maintainability and understandability of the systems. A new object extraction model using association rules and...Extracting objects from legacy systems is a basic step in system's object orientation to improve the maintainability and understandability of the systems. A new object extraction model using association rules and dependence analysis is proposed. In this model data are classified by association rules and the corresponding operations are partitioned by dependence analysis.展开更多
[Objectives]To explore the compatibility rules of neonatal parenteral nutrition(PN)prescriptions based on association rules and hierarchical cluster analysis,thereby providing a reference for standardizing neonatal pa...[Objectives]To explore the compatibility rules of neonatal parenteral nutrition(PN)prescriptions based on association rules and hierarchical cluster analysis,thereby providing a reference for standardizing neonatal parenteral nutrition supportive therapy.[Methods]The data about neonatal PN formulations prepared by the Pharmacy Intravenous Admixture Services(PIVAS)of the Affiliated Hospital of Chengde Medical University from July 2015 to June 2021 were collected.The general information of the prescriptions and the frequency of drug use were analyzed with Excel 2019;the boxplot of drug dosing was drawn using GraphPad 8.0 software;and SPSS Modeler 18.0 and SPSS Statistics 26.0 were used to perform association rules and hierarchical cluster analysis.[Results]A total of 11488 PN prescriptions were collected from 1421 newborns,involving 18 kinds of drugs,which were divided into 11 types of nutrients.Association rules analysis yielded 84 nutrient substance combinations.The combination of fat emulsion-water-soluble vitamins-fat-soluble vitamins-glucose-amino acids had the highest confidence(99.95%).The hierarchical cluster analysis divided nutrients into 5 types.[Conclusions]The prescriptions of PN for newborns were composed of five types of nutrients:amino acids,fat emulsion,glucose,water-soluble vitamins,and fat-soluble vitamins.According to the lack of electrolytes and trace elements,appropriate drugs can be chosen to meet nutritional demands.This study provides reference basis for reasonable selection of drugs for neonatal PN prescriptions and further standardization of PN supportive therapy in newborns.展开更多
The market trends rapidly changed over the last two decades.The primary reason is the newly created opportunities and the increased number of competitors competing to grasp market share using business analysis techniq...The market trends rapidly changed over the last two decades.The primary reason is the newly created opportunities and the increased number of competitors competing to grasp market share using business analysis techniques.Market Basket Analysis has a tangible effect in facilitating current change in the market.Market Basket Analysis is one of the famous fields that deal with Big Data and Data Mining applications.MBA initially uses Association Rule Learning(ARL)as a mean for realization.ARL has a beneficial effect in providing a plenty benefit in analyzing the market data and understanding customers’behavior.An important motive of using such techniques is maximizing the business profit as well as matching the exact customer needs as closely as possible.In this survey paper,we discussed several applications and methods of MBA based on ARL.Also,we reviewed some association rule learning measurements including trust,lift,leverage,and others.Furthermore,we discuss some open issues and future topics in the area of market basket analysis and association rule learning.展开更多
[Objectives]This study was conducted to analyze the medication rules of clinical prescriptions of traditional Chinese medicine decoction pieces for the treatment of novel coronavirus pneumonia(COVID-19)during the epid...[Objectives]This study was conducted to analyze the medication rules of clinical prescriptions of traditional Chinese medicine decoction pieces for the treatment of novel coronavirus pneumonia(COVID-19)during the epidemic in multiple regions based on data mining technology,so as to provide a reference for the treatment of COVID-19 with traditional Chinese medicine.[Methods]The traditional Chinese medicine prescriptions used since the outbreak of COVID-19 in Hubei Province during the fight against the epidemic from February 25,2020 to February 14,2022,the traditional Chinese medicine prescriptions used by Guizhou traditional Chinese medicine expert team aiding Hubei Province,the traditional Chinese medicine prescriptions for rehabilitation and conditioning of patients in Ezhou of Hubei Province after discharge,the traditional Chinese medicine prescriptions for the prevention and treatment of COVID-19 in Guizhou Province,and the traditional Chinese medicine prescriptions for the treatment of COVID-19 collected from the end of 2019 to the present from the Chinese database of CNKI were collected as the data of this study.Excel was used to establish a database and enter it into the TCM inheritance calculation platform V3.5,and the association rules and k-means clustering algorithm were used to analyze the frequency of herbal medicines in prescriptions during the treatment of COVID-19,the frequency of four natures,five flavors,meridian distribution,and drug combinations.[Results]A total of 1859 COVID-19 patients treated with traditional Chinese medicine were included,and the proportion of males was higher than that of females,and middle-aged and elderly people were the most common group.A total of 2170 prescriptions of traditional Chinese medicine were included,involving a total of 383 traditional Chinese medicines.High-frequency medicines included poria,Radix Bupleuri,Radix Scutellariae,Herba Pogostemonis,Fructus Forsythiae,Flos Loniceraeetc.The four natures were mainly concentrated in cold,warm and neutral,and the five flavors were mainly concentrated in bitter,pungent and sweet.The herbal medicines were mainly attributed to the lungs and stomach meridians,and were mainly of heat-clearing,exterior syndrome-relieving and diuresis-promoting and damp-clearing types.A total of 24 high-frequency herbal combinations and 35 association rule were excavated,and 3 types of formulas were obtained by cluster analysis.[Conclusions]The analysis results and medicine combinations obtained in the formulas are consistent with the traditional Chinese medicine treatment theory of COVID-19 caused by wind-heat filth accompanied with damp and toxin.展开更多
A method for mining frequent itemsets by evaluating their probability of supports based on association analysis is presented. This paper obtains the probability of every 1\|itemset by scanning the database, then evalu...A method for mining frequent itemsets by evaluating their probability of supports based on association analysis is presented. This paper obtains the probability of every 1\|itemset by scanning the database, then evaluates the probability of every 2\|itemset, every 3\|itemset, every k \|itemset from the frequent 1\|itemsets and gains all the candidate frequent itemsets. This paper also scans the database for verifying the support of the candidate frequent itemsets. Last, the frequent itemsets are mined. The method reduces a lot of time of scanning database and shortens the computation time of the algorithm.展开更多
Association rule mining methods, as a set of important data mining tools, could be used for mining spatial association rules of spatial data. However, applications of these methods are limited for mining results conta...Association rule mining methods, as a set of important data mining tools, could be used for mining spatial association rules of spatial data. However, applications of these methods are limited for mining results containing large number of redundant rules. In this paper, a new method named Geo-Filtered Association Rules Mining(GFARM) is proposed to effectively eliminate the redundant rules. An application of GFARM is performed as a case study in which association rules are discovered between building land distribution and potential driving factors in Wuhan, China from 1995 to 2015. Ten sets of regular sampling grids with different sizes are used for detecting the influence of multi-scales on GFARM. Results show that the proposed method can filter 50%–70% of redundant rules. GFARM is also successful in discovering spatial association pattern between building land distribution and driving factors.展开更多
Objective:Based on data mining software,applying frequent itemsets,association rules,hierarchical clustering,complex networks and other data mining methods to analyze the usage and compatibility of traditional Chinese...Objective:Based on data mining software,applying frequent itemsets,association rules,hierarchical clustering,complex networks and other data mining methods to analyze the usage and compatibility of traditional Chinese medicine(TCM)patent compound for functional dyspepsia.Method:Use the Chinese patent database to search the compound for the treatment of functional dyspepsia,exclude traditional Chinese medicine extracts,single drugs,combined use of Chinese and Western medicines,etc.,screen the patented compound of TCM,establish an Excel data table,and apply data mining software to The data is subjected to frequency statistics,association rules,cluster analysis and complex network analysis.Result:A total of 238 prescriptions for functional dyspepsia were screened.The four qi of the drugs were mainly warm and calm,the five flavors were mainly sweet and spicy,and the spleen and stomach were the main meridians.The top 10 Chinese medicines with higher frequency are Shanzha、Chenpi、Gancao、Maiya、Jineijin、Fuling、Baizhu、Shenqu、Houpo、Banxia;frequent itemsets show that the drugs are mainly compatible with qi and spleen,qi and digestion;association rules The analysis shows that the common drug pairs used in the treatment of functional dyspepsia include Chenpi-Shanzha、Maiya-Shanzha、Jineijin-Shanzha,etc.;cluster analysis found that there are 4 types of drugs for functional dyspepsia,mainly including drugs for regulating qi-flowing for harmonizing stomach,drugs for soothing liver and promoting Qi,drugs for eliminating food and resolving accumulation,drugs for benefiting qi and strengthening spleen;the 22-flavor Chinese medicine in the core drug network,the core compatibility is mainly to eliminate stagnation and spleen.Conclusion:Data mining research provides a reference for the clinical treatment of functional dyspepsia and the development of TCM formulas;Clinical treatment of functional dyspepsia should grasp the basic principles of strengthening vital energy and eliminating pathogenic factors to benefit qi,strengthen the spleen,and eliminate food.It is a basic treatment method,taking into account the methods of regulating qi-flowing for harmonizing stomach,soothing the liver and relieving depression,relieving dampness and dampness,and combining the specific conditions of patients with syndrome differentiation and treatment.展开更多
These days, health care systems such as pharmacies and drugstores normally produce high volumes of data. Consequently, utilizing data mining methods in health care systems has become a conventional process. In this re...These days, health care systems such as pharmacies and drugstores normally produce high volumes of data. Consequently, utilizing data mining methods in health care systems has become a conventional process. In this research, Apriori algorithm has been applied to perform data mining using the data obtained from the prescriptions ordered within a pharmacy. Ten association rules were achieved from the assigned pharmaceutical drugs in those prescriptions using the aforementioned Apriori algorithm. The accuracy of these rules is also manually studied and reviewed by a physician. Among these association rules, Vitamin D and Calcium pills are the most interrelated medications, and Omeprazole and Metronidazole rankd second in terms of association. The results of this study provide useful feedback information about associations among drugs.展开更多
Analyzing systemically more than 550 Li Dong Y uan’s formula of spleen and stomach by using the association rule to mine the in formation relativity between formula, herbal medicine and syndrome.From this we can stud...Analyzing systemically more than 550 Li Dong Y uan’s formula of spleen and stomach by using the association rule to mine the in formation relativity between formula, herbal medicine and syndrome.From this we can study better the compatibility regulations of the展开更多
Customer requirements analysis is the key step for product variety design of mass customiza-tion(MC). Quality function deployment (QFD) is a widely used management technique for understanding the voice of the customer...Customer requirements analysis is the key step for product variety design of mass customiza-tion(MC). Quality function deployment (QFD) is a widely used management technique for understanding the voice of the customer (VOC), however, QFD depends heavily on human subject judgment during extracting customer requirements and determination of the importance weights of customer requirements. QFD pro-cess and related problems are so complicated that it is not easily used. In this paper, based on a general data structure of product family, generic bill of material (GBOM), association rules analysis was introduced to construct the classification mechanism between customer requirements and product architecture. The new method can map customer requirements to the items of product family architecture respectively, accomplish the mapping process from customer domain to physical domain directly, and decrease mutual process between customer and designer, improve the product design quality, and thus furthest satisfy customer needs. Finally, an example of customer requirements mapping of the elevator cabin was used to illustrate the proposed method.展开更多
HA (hashing array), a new algorithm, for mining frequent itemsets of large database is proposed. It employs a structure hash array, ltemArray ( ) to store the information of database and then uses it instead of da...HA (hashing array), a new algorithm, for mining frequent itemsets of large database is proposed. It employs a structure hash array, ltemArray ( ) to store the information of database and then uses it instead of database in later iteration. By this improvement, only twice scanning of the whole database is necessary, thereby the computational cost can be reduced significantly. To overcome the performance bottleneck of frequent 2-itemsets mining, a modified algorithm of HA, DHA (directaddressing hashing and array) is proposed, which combines HA with direct-addressing hashing technique. The new hybrid algorithm, DHA, not only overcomes the performance bottleneck but also inherits the advantages of HA. Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm, and the results prove the new algorithm is more efficient and reasonable.展开更多
Background:The purpose of this study was to identify the characteristics and principles of acupoints applied for treating chronic hepatitis B infection.Methods:The published clinical studies on acupuncture for the tre...Background:The purpose of this study was to identify the characteristics and principles of acupoints applied for treating chronic hepatitis B infection.Methods:The published clinical studies on acupuncture for the treatment of chronic hepatitis B infection were gathered from various databases,including SinoMed,Chongqing Vip,China National Knowledge Infrastructure,Wanfang,the Cochrane Library,PubMed,Web of Science and Embase.Excel 2019 was utilized to establish a database of acupuncture prescriptions and conduct statistics on the frequency,meridian application,distribution and specific points,as well as SPSS Modeler 18.0 and SPSS Statistics 26.0 to conduct association rule analysis and cluster analysis to investigate the characteristics and patterns of acupoint selection.Results:A total of 42 studies containing 47 acupoints were included,with a total frequency of 286 acupoints.The top five acupoints used were Zusanli(ST36),Ganshu(BL18),Yanglingquan(GB34),Sanyinjiao(SP6)and Taichong(LR3),and the most commonly used meridians was the Bladder Meridian of Foot-Taiyang.The majority of acupuncture points are located in the lower limbs,back,and lumbar regions,with a significant percentage of them being Five-Shu acupoints.The strongest acupoint combination identified was Zusanli(ST36)–Ganshu(BL18),in addition to which 13 association rules and 4 valid clusters were obtained.Conclusion:Zusanli(ST36)–Ganshu(BL18)could be considered a relatively reasonable prescription for treating chronic hepatitis B infection in clinical practice.However,further high-quality studies are needed.展开更多
Mining the data from the huge collection that are present in the database and uncovering the relationships between the item set are one of the key aspects of data mining technologies. Itinerary planning system with pe...Mining the data from the huge collection that are present in the database and uncovering the relationships between the item set are one of the key aspects of data mining technologies. Itinerary planning system with personalization in selecting the places to the users is one of the demanding features in most of the travel plan. In this work, the system is designed in such a way to provide the customized journey plan to the users and also the effective one to the back pack travelers. Here the Points of Interests are the places to visit in each destination for the number of days chosen by the travelers. In this system, the users are allowed to specify the desired POIs to visit for the selected destination and can make their customized travel plan effectively. This proposed system is designed to choose the customized places to visit and to plan travel for K-day itineraries. The most visited itineraries are saved and updated in the database. Association rules are used to find out the frequent places visited in each destination and to provide the reputed places to the users to plan the journey. Here the Weka tool is used to evaluate the performance of the algorithm and the rules that are generated for the given travel dataset. Data set is designed by considering several attributes that can take part during travel such as source, destination, travel cost, budget, etc. Statistical analysis is done to evaluate the performance of the proposed system and the list of features that are present in the system. During the analysis part, registered users, number of logins, frequent visits, and attributes are analyzed. Thus the system can be redefined further with the help of this statistical analysis. It is mostly used at the organization end to evaluate their performance and improve the features. Report is generated once the user has chosen their customized places to visit and all detailed description of journey is presented to the user. Report could be saved at the user end and they can use it for the future reference. Thus the goal of the system is to provide the customized travel with personalization in choosing POIs and to find the frequent places visited with desired amenities.展开更多
To the Editor:Chinese physicians often address the combination of the properties and therapeutic efficacy of Chinese materia medica(CMM).They believe that the properties and therapeutic efficacy of traditional Chinese...To the Editor:Chinese physicians often address the combination of the properties and therapeutic efficacy of Chinese materia medica(CMM).They believe that the properties and therapeutic efficacy of traditional Chinese medicines(TCMs)should be considered as an"organic whole.""Use based on therapeutic efficacy"can lead to omission of the properties of CMM.It also展开更多
基于SQL Server Analysis Services(简称SSAS)提供的Microsoft关联规则挖掘算法和事务数据挖掘功能,通过利用Arc GIS软件、空间数据库引擎Arc SDE和数据库SQL Server软件,提出了一种新的土地地类关系挖掘实现方案。首先结合空间数据挖掘...基于SQL Server Analysis Services(简称SSAS)提供的Microsoft关联规则挖掘算法和事务数据挖掘功能,通过利用Arc GIS软件、空间数据库引擎Arc SDE和数据库SQL Server软件,提出了一种新的土地地类关系挖掘实现方案。首先结合空间数据挖掘(Spatial Data Mining,SDM)相关技术方法,以土地利用数据库为基础,实现空间数据提取;然后通过空间关联操作将空间信息转化为事务,最后在SSAS中创建多维数据集,完成相关数据挖掘任务。基于某市实例土地利用数据库,采用该方法探测相邻地类间的隐含关系,通过建立相邻地类图斑空间关联规则挖掘模型,设置不同的参数,得到了一系列比较实用合理的关联规则,通过实践证明了这种方案的有效性。展开更多
The combined influence of nonlinearity and dilation on slope stability was evaluated using the upper-bound limit analysis theorem.The mechanism of slope collapse was analyzed by dividing it into arbitrary discrete soi...The combined influence of nonlinearity and dilation on slope stability was evaluated using the upper-bound limit analysis theorem.The mechanism of slope collapse was analyzed by dividing it into arbitrary discrete soil blocks with the nonlinear Mohr–Coulomb failure criterion and nonassociated flow rule.The multipoint tangent(multi-tangent) technique was used to analyze the slope stability by linearizing the nonlinear failure criterion.A general expression for the slope safety factor was derived based on the virtual work principle and the strength reduction technique,and the global slope safety factor can be obtained by the optimization method of nonlinear sequential quadratic programming.The results show better agreement with previous research result when the nonlinear failure criterion reduces to a linear failure criterion or the non-associated flow rule reduces to an associated flow rule,which demonstrates the rationality of the presented method.Slope safety factors calculated by the multi-tangent inclined-slices technique were smaller than those obtained by the traditional single-tangent inclined-slices technique.The results show that the multi-tangent inclined-slices technique is a safe and effective method of slope stability limit analysis.The combined effect of nonlinearity and dilation on slope stability was analyzed,and the parameter analysis indicates that nonlinearity and dilation have significant influence on the result of slope stability analysis.展开更多
Based on the least-square minimization a computationally efficient learning algorithm for the Principal Component Analysis(PCA) is derived. The dual learning rate parameters are adaptively introduced to make the propo...Based on the least-square minimization a computationally efficient learning algorithm for the Principal Component Analysis(PCA) is derived. The dual learning rate parameters are adaptively introduced to make the proposed algorithm providing the capability of the fast convergence and high accuracy for extracting all the principal components. It is shown that all the information needed for PCA can be completely represented by the unnormalized weight vector which is updated based only on the corresponding neuron input-output product. The convergence performance of the proposed algorithm is briefly analyzed.The relation between Oja’s rule and the least squares learning rule is also established. Finally, a simulation example is given to illustrate the effectiveness of this algorithm for PCA.展开更多
Objective:To explore the medication rule of Traditional Chinese Medicine(TCM)in the treatment of sleep disorder after stroke by using data mining technology.Methods:A computer search was used to search the electronic ...Objective:To explore the medication rule of Traditional Chinese Medicine(TCM)in the treatment of sleep disorder after stroke by using data mining technology.Methods:A computer search was used to search the electronic database of clinical literature on the treatment of sleep disorders after stroke by TCM from January 2000 to January 2021.Excel was used to establish the database,and the prescription information was described and analyzed statistically.Using IBM SPSS Modeler 18.0 software,Apriori algorithm was used for TCM association analysis,and IBM SPSS 22.0 software was used for systematic cluster analysis of high-frequency TCM.Results:A total of 67 literatures were included,covering 131 traditional Chinese medicines.The medecines with a higher frequency of sodium use include Ziziphi Spinosae Semen(Suanzaoren),Angelicae Sinensis Radix(Danggui),Ligusticum(Chuanxiong),liquorice(Gancao),Poria cocos(Fuling),and so on.From the effect point of view,deficiency-tonifying medicine,sedative medicine and blood-activating and stasis-removing medicine are commonly used.The medicinal properties are mainly cold,mild and warm.The main medicine flavor are sweet and bitter.The medicines mostly belong to the liver,heart and spleen Meridian.Thirty-three association rules were obtained for medicine pairs and medicine groups from the correlation analysis,and the core combinations were"Ziziphi Spinosae Semen(Suanzaoren)-Tuber fleeceflower stem(Yejiaoteng)","Ziziphi Spinosae Semen(Suanzaoren)-Polygala(Yuanzhi)","Ziziphi Spinosae Semen(Suanzaoren)-Cortex albiziae(Hehuanpi)"and"Angelicae Sinensis Radix(Danggui)-Radix bupleuri(Chaihu)-Radix Paeoniae Alba(Baishao)"and so on.Seven medicine aggregation groups were obtained by medicine cluster analysis.Conclusion:In the treatment of sleep disorder after stroke by TCM,the main method is to calm the heart and mind.Meanwhile,according to different syndrome types,the treatment methods of tonifying the heart and spleen,nourishing the liver and kidney,soothing the liver and softening the liver,clearing heat and resolving phlegm,nourishing the blood and promoting blood circulation are selected,which provide certain reference for clinical treatment.展开更多
文摘This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfaction. It addresses a significant research gap in understanding metro passengers’ dynamics across cultural and geographical contexts. It employs questionnaires, field observations, and advanced data analysis techniques like association rule mining and neural network modeling. Key findings include a correlation between rainy weather, shorter waiting times, and higher arrival volumes. Neural network models showed high predictive accuracy, with waiting time, metro satisfaction, and weather being significant factors in Lagos Light Rail Blue Line Metro. In contrast, arrival patterns, weather, and time of day were more influential in Wuhan Metro Line 5. Results suggest that improving metro satisfaction and reducing waiting times could increase arrival volumes in Lagos Metro while adjusting schedules for weather and peak times could optimize flow in Wuhan Metro. These insights are valuable for transportation planning, passenger arrival volume management, and enhancing user experiences, potentially benefiting urban transportation sustainability and development goals.
基金supported by the National Natural Science Foundation of China (No. J07240003, No. 60773084, No. 60603023)National Research Fund for the Doctoral Program of Higher Education of China (No. 20070151009)
文摘The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, the number of useful rules is hard to estimate. If the number is too large, we cannot effectively extract the meaningful rules. This paper analyzes the meanings of the parameters and designs a variety of equations between the number of rules and the parameters by using regression method. Finally, we experimentally obtain a preferable regression equation. This paper uses multiple correlation coeficients to test the fitting efiects of the equations and uses significance test to verify whether the coeficients of parameters are significantly zero or not. The regression equation that has a larger multiple correlation coeficient will be chosen as the optimally fitted equation. With the selected optimal equation, we can predict the number of rules under the given parameters and further optimize the choice of the three parameters and determine their ranges of values.
基金Supported in part by the National Natural Science F oundation of China(6 0 0 730 12 )
文摘Extracting objects from legacy systems is a basic step in system's object orientation to improve the maintainability and understandability of the systems. A new object extraction model using association rules and dependence analysis is proposed. In this model data are classified by association rules and the corresponding operations are partitioned by dependence analysis.
基金Supported by Science and Technology Research and Development Project of Chengde City,Hebei Province(201706A043)Young Scholar Program of Hebei Pharmaceutical Association Hospital Pharmaceutical Research Project(2020—Hbsyxhqn0029).
文摘[Objectives]To explore the compatibility rules of neonatal parenteral nutrition(PN)prescriptions based on association rules and hierarchical cluster analysis,thereby providing a reference for standardizing neonatal parenteral nutrition supportive therapy.[Methods]The data about neonatal PN formulations prepared by the Pharmacy Intravenous Admixture Services(PIVAS)of the Affiliated Hospital of Chengde Medical University from July 2015 to June 2021 were collected.The general information of the prescriptions and the frequency of drug use were analyzed with Excel 2019;the boxplot of drug dosing was drawn using GraphPad 8.0 software;and SPSS Modeler 18.0 and SPSS Statistics 26.0 were used to perform association rules and hierarchical cluster analysis.[Results]A total of 11488 PN prescriptions were collected from 1421 newborns,involving 18 kinds of drugs,which were divided into 11 types of nutrients.Association rules analysis yielded 84 nutrient substance combinations.The combination of fat emulsion-water-soluble vitamins-fat-soluble vitamins-glucose-amino acids had the highest confidence(99.95%).The hierarchical cluster analysis divided nutrients into 5 types.[Conclusions]The prescriptions of PN for newborns were composed of five types of nutrients:amino acids,fat emulsion,glucose,water-soluble vitamins,and fat-soluble vitamins.According to the lack of electrolytes and trace elements,appropriate drugs can be chosen to meet nutritional demands.This study provides reference basis for reasonable selection of drugs for neonatal PN prescriptions and further standardization of PN supportive therapy in newborns.
文摘The market trends rapidly changed over the last two decades.The primary reason is the newly created opportunities and the increased number of competitors competing to grasp market share using business analysis techniques.Market Basket Analysis has a tangible effect in facilitating current change in the market.Market Basket Analysis is one of the famous fields that deal with Big Data and Data Mining applications.MBA initially uses Association Rule Learning(ARL)as a mean for realization.ARL has a beneficial effect in providing a plenty benefit in analyzing the market data and understanding customers’behavior.An important motive of using such techniques is maximizing the business profit as well as matching the exact customer needs as closely as possible.In this survey paper,we discussed several applications and methods of MBA based on ARL.Also,we reviewed some association rule learning measurements including trust,lift,leverage,and others.Furthermore,we discuss some open issues and future topics in the area of market basket analysis and association rule learning.
基金Supported by Public Health and Epidemic Prevention and Control Project of Guiyang Bureau of Science and Technology([2022]-4-4-5)Guizhou Provincial Key Discipline of Traditional Chinese Medicine and Ethnic Medicine:Clinical Traditional Chinese Medicine(QZYYZDXK(JS)-2023-04).
文摘[Objectives]This study was conducted to analyze the medication rules of clinical prescriptions of traditional Chinese medicine decoction pieces for the treatment of novel coronavirus pneumonia(COVID-19)during the epidemic in multiple regions based on data mining technology,so as to provide a reference for the treatment of COVID-19 with traditional Chinese medicine.[Methods]The traditional Chinese medicine prescriptions used since the outbreak of COVID-19 in Hubei Province during the fight against the epidemic from February 25,2020 to February 14,2022,the traditional Chinese medicine prescriptions used by Guizhou traditional Chinese medicine expert team aiding Hubei Province,the traditional Chinese medicine prescriptions for rehabilitation and conditioning of patients in Ezhou of Hubei Province after discharge,the traditional Chinese medicine prescriptions for the prevention and treatment of COVID-19 in Guizhou Province,and the traditional Chinese medicine prescriptions for the treatment of COVID-19 collected from the end of 2019 to the present from the Chinese database of CNKI were collected as the data of this study.Excel was used to establish a database and enter it into the TCM inheritance calculation platform V3.5,and the association rules and k-means clustering algorithm were used to analyze the frequency of herbal medicines in prescriptions during the treatment of COVID-19,the frequency of four natures,five flavors,meridian distribution,and drug combinations.[Results]A total of 1859 COVID-19 patients treated with traditional Chinese medicine were included,and the proportion of males was higher than that of females,and middle-aged and elderly people were the most common group.A total of 2170 prescriptions of traditional Chinese medicine were included,involving a total of 383 traditional Chinese medicines.High-frequency medicines included poria,Radix Bupleuri,Radix Scutellariae,Herba Pogostemonis,Fructus Forsythiae,Flos Loniceraeetc.The four natures were mainly concentrated in cold,warm and neutral,and the five flavors were mainly concentrated in bitter,pungent and sweet.The herbal medicines were mainly attributed to the lungs and stomach meridians,and were mainly of heat-clearing,exterior syndrome-relieving and diuresis-promoting and damp-clearing types.A total of 24 high-frequency herbal combinations and 35 association rule were excavated,and 3 types of formulas were obtained by cluster analysis.[Conclusions]The analysis results and medicine combinations obtained in the formulas are consistent with the traditional Chinese medicine treatment theory of COVID-19 caused by wind-heat filth accompanied with damp and toxin.
文摘A method for mining frequent itemsets by evaluating their probability of supports based on association analysis is presented. This paper obtains the probability of every 1\|itemset by scanning the database, then evaluates the probability of every 2\|itemset, every 3\|itemset, every k \|itemset from the frequent 1\|itemsets and gains all the candidate frequent itemsets. This paper also scans the database for verifying the support of the candidate frequent itemsets. Last, the frequent itemsets are mined. The method reduces a lot of time of scanning database and shortens the computation time of the algorithm.
基金Under the auspices of Special Fund of Ministry of Land and Resources of China in Public Interest(No.201511001)
文摘Association rule mining methods, as a set of important data mining tools, could be used for mining spatial association rules of spatial data. However, applications of these methods are limited for mining results containing large number of redundant rules. In this paper, a new method named Geo-Filtered Association Rules Mining(GFARM) is proposed to effectively eliminate the redundant rules. An application of GFARM is performed as a case study in which association rules are discovered between building land distribution and potential driving factors in Wuhan, China from 1995 to 2015. Ten sets of regular sampling grids with different sizes are used for detecting the influence of multi-scales on GFARM. Results show that the proposed method can filter 50%–70% of redundant rules. GFARM is also successful in discovering spatial association pattern between building land distribution and driving factors.
基金Capital project for application and promotion of clinical researches(No.Z171100001017123)Capital specialized scientific research proect of health development for young excellent talents(No.2018-4-4078)。
文摘Objective:Based on data mining software,applying frequent itemsets,association rules,hierarchical clustering,complex networks and other data mining methods to analyze the usage and compatibility of traditional Chinese medicine(TCM)patent compound for functional dyspepsia.Method:Use the Chinese patent database to search the compound for the treatment of functional dyspepsia,exclude traditional Chinese medicine extracts,single drugs,combined use of Chinese and Western medicines,etc.,screen the patented compound of TCM,establish an Excel data table,and apply data mining software to The data is subjected to frequency statistics,association rules,cluster analysis and complex network analysis.Result:A total of 238 prescriptions for functional dyspepsia were screened.The four qi of the drugs were mainly warm and calm,the five flavors were mainly sweet and spicy,and the spleen and stomach were the main meridians.The top 10 Chinese medicines with higher frequency are Shanzha、Chenpi、Gancao、Maiya、Jineijin、Fuling、Baizhu、Shenqu、Houpo、Banxia;frequent itemsets show that the drugs are mainly compatible with qi and spleen,qi and digestion;association rules The analysis shows that the common drug pairs used in the treatment of functional dyspepsia include Chenpi-Shanzha、Maiya-Shanzha、Jineijin-Shanzha,etc.;cluster analysis found that there are 4 types of drugs for functional dyspepsia,mainly including drugs for regulating qi-flowing for harmonizing stomach,drugs for soothing liver and promoting Qi,drugs for eliminating food and resolving accumulation,drugs for benefiting qi and strengthening spleen;the 22-flavor Chinese medicine in the core drug network,the core compatibility is mainly to eliminate stagnation and spleen.Conclusion:Data mining research provides a reference for the clinical treatment of functional dyspepsia and the development of TCM formulas;Clinical treatment of functional dyspepsia should grasp the basic principles of strengthening vital energy and eliminating pathogenic factors to benefit qi,strengthen the spleen,and eliminate food.It is a basic treatment method,taking into account the methods of regulating qi-flowing for harmonizing stomach,soothing the liver and relieving depression,relieving dampness and dampness,and combining the specific conditions of patients with syndrome differentiation and treatment.
文摘These days, health care systems such as pharmacies and drugstores normally produce high volumes of data. Consequently, utilizing data mining methods in health care systems has become a conventional process. In this research, Apriori algorithm has been applied to perform data mining using the data obtained from the prescriptions ordered within a pharmacy. Ten association rules were achieved from the assigned pharmaceutical drugs in those prescriptions using the aforementioned Apriori algorithm. The accuracy of these rules is also manually studied and reviewed by a physician. Among these association rules, Vitamin D and Calcium pills are the most interrelated medications, and Omeprazole and Metronidazole rankd second in terms of association. The results of this study provide useful feedback information about associations among drugs.
文摘Analyzing systemically more than 550 Li Dong Y uan’s formula of spleen and stomach by using the association rule to mine the in formation relativity between formula, herbal medicine and syndrome.From this we can study better the compatibility regulations of the
基金the National Natural Science Founda-tion of China (No. 70471022)the NSFC / Hong KongResearch Grant Council (No. 70418013)
文摘Customer requirements analysis is the key step for product variety design of mass customiza-tion(MC). Quality function deployment (QFD) is a widely used management technique for understanding the voice of the customer (VOC), however, QFD depends heavily on human subject judgment during extracting customer requirements and determination of the importance weights of customer requirements. QFD pro-cess and related problems are so complicated that it is not easily used. In this paper, based on a general data structure of product family, generic bill of material (GBOM), association rules analysis was introduced to construct the classification mechanism between customer requirements and product architecture. The new method can map customer requirements to the items of product family architecture respectively, accomplish the mapping process from customer domain to physical domain directly, and decrease mutual process between customer and designer, improve the product design quality, and thus furthest satisfy customer needs. Finally, an example of customer requirements mapping of the elevator cabin was used to illustrate the proposed method.
文摘HA (hashing array), a new algorithm, for mining frequent itemsets of large database is proposed. It employs a structure hash array, ltemArray ( ) to store the information of database and then uses it instead of database in later iteration. By this improvement, only twice scanning of the whole database is necessary, thereby the computational cost can be reduced significantly. To overcome the performance bottleneck of frequent 2-itemsets mining, a modified algorithm of HA, DHA (directaddressing hashing and array) is proposed, which combines HA with direct-addressing hashing technique. The new hybrid algorithm, DHA, not only overcomes the performance bottleneck but also inherits the advantages of HA. Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm, and the results prove the new algorithm is more efficient and reasonable.
基金supported by Chongqing Municipal Health and Family Planning Commission and Chongqing Municipal Science and Technology Commission Jointly Funded Key Research Projects in Traditional Chinese Medicine(ZY201801007).
文摘Background:The purpose of this study was to identify the characteristics and principles of acupoints applied for treating chronic hepatitis B infection.Methods:The published clinical studies on acupuncture for the treatment of chronic hepatitis B infection were gathered from various databases,including SinoMed,Chongqing Vip,China National Knowledge Infrastructure,Wanfang,the Cochrane Library,PubMed,Web of Science and Embase.Excel 2019 was utilized to establish a database of acupuncture prescriptions and conduct statistics on the frequency,meridian application,distribution and specific points,as well as SPSS Modeler 18.0 and SPSS Statistics 26.0 to conduct association rule analysis and cluster analysis to investigate the characteristics and patterns of acupoint selection.Results:A total of 42 studies containing 47 acupoints were included,with a total frequency of 286 acupoints.The top five acupoints used were Zusanli(ST36),Ganshu(BL18),Yanglingquan(GB34),Sanyinjiao(SP6)and Taichong(LR3),and the most commonly used meridians was the Bladder Meridian of Foot-Taiyang.The majority of acupuncture points are located in the lower limbs,back,and lumbar regions,with a significant percentage of them being Five-Shu acupoints.The strongest acupoint combination identified was Zusanli(ST36)–Ganshu(BL18),in addition to which 13 association rules and 4 valid clusters were obtained.Conclusion:Zusanli(ST36)–Ganshu(BL18)could be considered a relatively reasonable prescription for treating chronic hepatitis B infection in clinical practice.However,further high-quality studies are needed.
文摘Mining the data from the huge collection that are present in the database and uncovering the relationships between the item set are one of the key aspects of data mining technologies. Itinerary planning system with personalization in selecting the places to the users is one of the demanding features in most of the travel plan. In this work, the system is designed in such a way to provide the customized journey plan to the users and also the effective one to the back pack travelers. Here the Points of Interests are the places to visit in each destination for the number of days chosen by the travelers. In this system, the users are allowed to specify the desired POIs to visit for the selected destination and can make their customized travel plan effectively. This proposed system is designed to choose the customized places to visit and to plan travel for K-day itineraries. The most visited itineraries are saved and updated in the database. Association rules are used to find out the frequent places visited in each destination and to provide the reputed places to the users to plan the journey. Here the Weka tool is used to evaluate the performance of the algorithm and the rules that are generated for the given travel dataset. Data set is designed by considering several attributes that can take part during travel such as source, destination, travel cost, budget, etc. Statistical analysis is done to evaluate the performance of the proposed system and the list of features that are present in the system. During the analysis part, registered users, number of logins, frequent visits, and attributes are analyzed. Thus the system can be redefined further with the help of this statistical analysis. It is mostly used at the organization end to evaluate their performance and improve the features. Report is generated once the user has chosen their customized places to visit and all detailed description of journey is presented to the user. Report could be saved at the user end and they can use it for the future reference. Thus the goal of the system is to provide the customized travel with personalization in choosing POIs and to find the frequent places visited with desired amenities.
基金a grant from the National Natural Science Foundation of China (No.81660727).
文摘To the Editor:Chinese physicians often address the combination of the properties and therapeutic efficacy of Chinese materia medica(CMM).They believe that the properties and therapeutic efficacy of traditional Chinese medicines(TCMs)should be considered as an"organic whole.""Use based on therapeutic efficacy"can lead to omission of the properties of CMM.It also
文摘基于SQL Server Analysis Services(简称SSAS)提供的Microsoft关联规则挖掘算法和事务数据挖掘功能,通过利用Arc GIS软件、空间数据库引擎Arc SDE和数据库SQL Server软件,提出了一种新的土地地类关系挖掘实现方案。首先结合空间数据挖掘(Spatial Data Mining,SDM)相关技术方法,以土地利用数据库为基础,实现空间数据提取;然后通过空间关联操作将空间信息转化为事务,最后在SSAS中创建多维数据集,完成相关数据挖掘任务。基于某市实例土地利用数据库,采用该方法探测相邻地类间的隐含关系,通过建立相邻地类图斑空间关联规则挖掘模型,设置不同的参数,得到了一系列比较实用合理的关联规则,通过实践证明了这种方案的有效性。
基金Projects(51208522,51478477)supported by the National Natural Science Foundation of ChinaProject(2012122033)supported by the Guizhou Provincial Department of Transportation Foundation,ChinaProject(CX2015B049)supported by the Scientific Research Innovation Project of Hunan Province,China
文摘The combined influence of nonlinearity and dilation on slope stability was evaluated using the upper-bound limit analysis theorem.The mechanism of slope collapse was analyzed by dividing it into arbitrary discrete soil blocks with the nonlinear Mohr–Coulomb failure criterion and nonassociated flow rule.The multipoint tangent(multi-tangent) technique was used to analyze the slope stability by linearizing the nonlinear failure criterion.A general expression for the slope safety factor was derived based on the virtual work principle and the strength reduction technique,and the global slope safety factor can be obtained by the optimization method of nonlinear sequential quadratic programming.The results show better agreement with previous research result when the nonlinear failure criterion reduces to a linear failure criterion or the non-associated flow rule reduces to an associated flow rule,which demonstrates the rationality of the presented method.Slope safety factors calculated by the multi-tangent inclined-slices technique were smaller than those obtained by the traditional single-tangent inclined-slices technique.The results show that the multi-tangent inclined-slices technique is a safe and effective method of slope stability limit analysis.The combined effect of nonlinearity and dilation on slope stability was analyzed,and the parameter analysis indicates that nonlinearity and dilation have significant influence on the result of slope stability analysis.
基金Supported by the National Natural Science Foundation of Chinathe Science foundation of Guangxi Educational Administration
文摘Based on the least-square minimization a computationally efficient learning algorithm for the Principal Component Analysis(PCA) is derived. The dual learning rate parameters are adaptively introduced to make the proposed algorithm providing the capability of the fast convergence and high accuracy for extracting all the principal components. It is shown that all the information needed for PCA can be completely represented by the unnormalized weight vector which is updated based only on the corresponding neuron input-output product. The convergence performance of the proposed algorithm is briefly analyzed.The relation between Oja’s rule and the least squares learning rule is also established. Finally, a simulation example is given to illustrate the effectiveness of this algorithm for PCA.
基金Beijing Science and Technology Program(No.Z191100006619065)National Key R&D Program(No.2017YFC1700101)。
文摘Objective:To explore the medication rule of Traditional Chinese Medicine(TCM)in the treatment of sleep disorder after stroke by using data mining technology.Methods:A computer search was used to search the electronic database of clinical literature on the treatment of sleep disorders after stroke by TCM from January 2000 to January 2021.Excel was used to establish the database,and the prescription information was described and analyzed statistically.Using IBM SPSS Modeler 18.0 software,Apriori algorithm was used for TCM association analysis,and IBM SPSS 22.0 software was used for systematic cluster analysis of high-frequency TCM.Results:A total of 67 literatures were included,covering 131 traditional Chinese medicines.The medecines with a higher frequency of sodium use include Ziziphi Spinosae Semen(Suanzaoren),Angelicae Sinensis Radix(Danggui),Ligusticum(Chuanxiong),liquorice(Gancao),Poria cocos(Fuling),and so on.From the effect point of view,deficiency-tonifying medicine,sedative medicine and blood-activating and stasis-removing medicine are commonly used.The medicinal properties are mainly cold,mild and warm.The main medicine flavor are sweet and bitter.The medicines mostly belong to the liver,heart and spleen Meridian.Thirty-three association rules were obtained for medicine pairs and medicine groups from the correlation analysis,and the core combinations were"Ziziphi Spinosae Semen(Suanzaoren)-Tuber fleeceflower stem(Yejiaoteng)","Ziziphi Spinosae Semen(Suanzaoren)-Polygala(Yuanzhi)","Ziziphi Spinosae Semen(Suanzaoren)-Cortex albiziae(Hehuanpi)"and"Angelicae Sinensis Radix(Danggui)-Radix bupleuri(Chaihu)-Radix Paeoniae Alba(Baishao)"and so on.Seven medicine aggregation groups were obtained by medicine cluster analysis.Conclusion:In the treatment of sleep disorder after stroke by TCM,the main method is to calm the heart and mind.Meanwhile,according to different syndrome types,the treatment methods of tonifying the heart and spleen,nourishing the liver and kidney,soothing the liver and softening the liver,clearing heat and resolving phlegm,nourishing the blood and promoting blood circulation are selected,which provide certain reference for clinical treatment.