It is of great significance to improve the efficiency of railway production and operation by realizing the fault knowledge association through the efficient data mining algorithm.However,high utility quantitative freq...It is of great significance to improve the efficiency of railway production and operation by realizing the fault knowledge association through the efficient data mining algorithm.However,high utility quantitative frequent pattern mining algorithms in the field of data mining still suffer from the problems of low time-memory performance and are not easy to scale up.In the context of such needs,we propose a related degree-based frequent pattern mining algorithm,named Related High Utility Quantitative Item set Mining(RHUQI-Miner),to enable the effective mining of railway fault data.The algorithm constructs the item-related degree structure of fault data and gives a pruning optimization strategy to find frequent patterns with higher related degrees,reducing redundancy and invalid frequent patterns.Subsequently,it uses the fixed pattern length strategy to modify the utility information of the item in the mining process so that the algorithm can control the length of the output frequent pattern according to the actual data situation and further improve the performance and practicability of the algorithm.The experimental results on the real fault dataset show that RHUQI-Miner can effectively reduce the time and memory consumption in the mining process,thus providing data support for differentiated and precise maintenance strategies.展开更多
Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only ...Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only some operations such as "and", "or" and "xor". Applying this idea in the existed distributed association rule mining al gorithm FDM, the improved algorithm BFDM is proposed. The theoretical analysis and experiment testify that BFDM is effective and efficient.展开更多
In order to improve the utilization of the residential electricity consumption data which contains the information on the user’s electricity consumption habits, a residential electricity consumption behaviors mining ...In order to improve the utilization of the residential electricity consumption data which contains the information on the user’s electricity consumption habits, a residential electricity consumption behaviors mining algorithm model is constructed. Firstly, according to the attribute, the collected data can be divided into the global data and the phase data, then the appropriate global variables are selected to mine the user’s electricity consumption patterns in the near future on the system clustering algorithm. Based on the theory of grey relational analysis, combing phase data with the power modes to analyze the potential characteristics of residential electricity consumption behaviors deeply that verify the ability of latest power mode to predict household electricity consumption situation in the coming few days and the effect of dominant phase variables on the peak load shifting. Finally, from the actual data of a certain family, the proposed data mining algorithm is testified that it can effectively explore the electricity consumption behavior habits and characteristics of the family.展开更多
To overcome the subjectivity of experts in the process of risk response scheme selection, according to the theory of group decision making, a selection method and flow of the risk response schemes for a mining project...To overcome the subjectivity of experts in the process of risk response scheme selection, according to the theory of group decision making, a selection method and flow of the risk response schemes for a mining project was proposed based on fuzzy preference relation and consistency induced ordered weighted averaging (C-IOWA) operator,which can overcome the loss of information in the process of group decision making to a great degree, and improve its efficiency and quality.A numeric example was introduced to illustrate the application of the method, also validating the method as scientific and practicable.展开更多
The subject investigated the system of people-machine-environment in coal mines.The coal mines working process was researched and the theory of grey system was applied to analyze coal mines safety accidents and those ...The subject investigated the system of people-machine-environment in coal mines.The coal mines working process was researched and the theory of grey system was applied to analyze coal mines safety accidents and those relevant factors.This re- search reveals that this analysis method is easy and highly available and the result is of great credibility,which can not only provide theoretical supports to the quantitative study of coal mines safety accident,but offer basis for coal mines companies' safety management.展开更多
As data mining more and more popular applied in computer system,the quality as-surance test of its software would be get more and more attention.However,because of the ex-istence of the 'oracle' problem,the tr...As data mining more and more popular applied in computer system,the quality as-surance test of its software would be get more and more attention.However,because of the ex-istence of the 'oracle' problem,the traditional test method is not ease fit for the application program in the field of the data mining.In this paper,based on metamorphic testing,a software testing method is proposed in the field of the data mining,makes an association rules algorithm as the specific case,and constructs the metamorphic relation on the algorithm.Experiences show that the method can achieve the testing target and is feasible to apply to other domain.展开更多
One of the leading cancers for both genders worldwide is lung cancer.The occurrence of lung cancer has fully augmented since the early 19th century.In this manuscript,we have discussed various data mining techniques t...One of the leading cancers for both genders worldwide is lung cancer.The occurrence of lung cancer has fully augmented since the early 19th century.In this manuscript,we have discussed various data mining techniques that have been employed for cancer diagnosis.Exposure to air pollution has been related to various adverse health effects.This work is subject to analysis of various air pollutants and associated health hazards and intends to evaluate the impact of air pollution caused by lung cancer.We have introduced data mining in lung cancer to air pollution,and our approach includes preprocessing,data mining,testing and evaluation,and knowledge discovery.Initially,we will eradicate the noise and irrelevant data,and following that,we will join the multiple informed sources into a common source.From that source,we will designate the information relevant to our investigation to be regained from that assortment.Following that,we will convert the designated data into a suitable mining process.The patterns are abstracted by utilizing a relational suggestion rule mining process.These patterns have revealed information,and this information is categorized with the help of an Auto Associative Neural Network classification method(AANN).The proposed method is compared with the existing method in various factors.In conclusion,the projected Auto associative neural network and relational suggestion rule mining methods accomplish a high accuracy status.展开更多
目的采用数据挖掘技术探究穴位刺激治疗癌因性疲乏的选穴配穴规律。方法检索中国知识资源总库(CNKI)、万方数据知识服务平台(Wanfang Data)、中文科技期刊数据库(VIP)、中国生物医学文献服务系统(SinoMed)、PubMed、Web of Science、Emb...目的采用数据挖掘技术探究穴位刺激治疗癌因性疲乏的选穴配穴规律。方法检索中国知识资源总库(CNKI)、万方数据知识服务平台(Wanfang Data)、中文科技期刊数据库(VIP)、中国生物医学文献服务系统(SinoMed)、PubMed、Web of Science、Embase建库至2023年10月13日收录的穴位刺激治疗癌因性疲乏的临床随机对照试验文献,采用Excel2021和R4.3.1对处方腧穴进行频次分析、关联规则分析和聚类分析。结果共纳入文献187篇,包含214条腧穴处方,涉及102个腧穴,使用总频次为1044。使用频次前3位腧穴为足三里、气海、关元,前3位经络为任脉、足阳明胃经和足太阴脾经;腧穴多分布在下肢部和胸腹部,交会穴为使用频次最高的特定穴,艾灸是最常用的干预措施。关联规则分析显示,足三里-气海-关元-三阴交为核心腧穴组合。对肺癌、乳腺癌、结直肠癌、妇科肿瘤和胃癌选穴分析显示,补虚要穴最常选用。聚类分析得到5组腧穴有效聚类群。结论穴位刺激治疗癌因性疲乏选穴以益气养血、扶正祛邪为主要原则,重视特定穴的使用,并根据脏腑经络辨证运用多种配穴方法以提高临床疗效。展开更多
基金supported by the Research on Key Technologies and Typical Applications of Big Data in Railway Production and Operation(P2023S006)the Fundamental Research Funds for the Central Universities(2022JBZY023).
文摘It is of great significance to improve the efficiency of railway production and operation by realizing the fault knowledge association through the efficient data mining algorithm.However,high utility quantitative frequent pattern mining algorithms in the field of data mining still suffer from the problems of low time-memory performance and are not easy to scale up.In the context of such needs,we propose a related degree-based frequent pattern mining algorithm,named Related High Utility Quantitative Item set Mining(RHUQI-Miner),to enable the effective mining of railway fault data.The algorithm constructs the item-related degree structure of fault data and gives a pruning optimization strategy to find frequent patterns with higher related degrees,reducing redundancy and invalid frequent patterns.Subsequently,it uses the fixed pattern length strategy to modify the utility information of the item in the mining process so that the algorithm can control the length of the output frequent pattern according to the actual data situation and further improve the performance and practicability of the algorithm.The experimental results on the real fault dataset show that RHUQI-Miner can effectively reduce the time and memory consumption in the mining process,thus providing data support for differentiated and precise maintenance strategies.
基金Supported by the National Natural Science Foun-dation of China (70371015)
文摘Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only some operations such as "and", "or" and "xor". Applying this idea in the existed distributed association rule mining al gorithm FDM, the improved algorithm BFDM is proposed. The theoretical analysis and experiment testify that BFDM is effective and efficient.
文摘In order to improve the utilization of the residential electricity consumption data which contains the information on the user’s electricity consumption habits, a residential electricity consumption behaviors mining algorithm model is constructed. Firstly, according to the attribute, the collected data can be divided into the global data and the phase data, then the appropriate global variables are selected to mine the user’s electricity consumption patterns in the near future on the system clustering algorithm. Based on the theory of grey relational analysis, combing phase data with the power modes to analyze the potential characteristics of residential electricity consumption behaviors deeply that verify the ability of latest power mode to predict household electricity consumption situation in the coming few days and the effect of dominant phase variables on the peak load shifting. Finally, from the actual data of a certain family, the proposed data mining algorithm is testified that it can effectively explore the electricity consumption behavior habits and characteristics of the family.
文摘To overcome the subjectivity of experts in the process of risk response scheme selection, according to the theory of group decision making, a selection method and flow of the risk response schemes for a mining project was proposed based on fuzzy preference relation and consistency induced ordered weighted averaging (C-IOWA) operator,which can overcome the loss of information in the process of group decision making to a great degree, and improve its efficiency and quality.A numeric example was introduced to illustrate the application of the method, also validating the method as scientific and practicable.
文摘The subject investigated the system of people-machine-environment in coal mines.The coal mines working process was researched and the theory of grey system was applied to analyze coal mines safety accidents and those relevant factors.This re- search reveals that this analysis method is easy and highly available and the result is of great credibility,which can not only provide theoretical supports to the quantitative study of coal mines safety accident,but offer basis for coal mines companies' safety management.
文摘As data mining more and more popular applied in computer system,the quality as-surance test of its software would be get more and more attention.However,because of the ex-istence of the 'oracle' problem,the traditional test method is not ease fit for the application program in the field of the data mining.In this paper,based on metamorphic testing,a software testing method is proposed in the field of the data mining,makes an association rules algorithm as the specific case,and constructs the metamorphic relation on the algorithm.Experiences show that the method can achieve the testing target and is feasible to apply to other domain.
基金support from Taif University Researchers supporting Project Number(TURSP-2020/215),Taif University,Taif,Saudi Arabia.
文摘One of the leading cancers for both genders worldwide is lung cancer.The occurrence of lung cancer has fully augmented since the early 19th century.In this manuscript,we have discussed various data mining techniques that have been employed for cancer diagnosis.Exposure to air pollution has been related to various adverse health effects.This work is subject to analysis of various air pollutants and associated health hazards and intends to evaluate the impact of air pollution caused by lung cancer.We have introduced data mining in lung cancer to air pollution,and our approach includes preprocessing,data mining,testing and evaluation,and knowledge discovery.Initially,we will eradicate the noise and irrelevant data,and following that,we will join the multiple informed sources into a common source.From that source,we will designate the information relevant to our investigation to be regained from that assortment.Following that,we will convert the designated data into a suitable mining process.The patterns are abstracted by utilizing a relational suggestion rule mining process.These patterns have revealed information,and this information is categorized with the help of an Auto Associative Neural Network classification method(AANN).The proposed method is compared with the existing method in various factors.In conclusion,the projected Auto associative neural network and relational suggestion rule mining methods accomplish a high accuracy status.
文摘目的采用数据挖掘技术探究穴位刺激治疗癌因性疲乏的选穴配穴规律。方法检索中国知识资源总库(CNKI)、万方数据知识服务平台(Wanfang Data)、中文科技期刊数据库(VIP)、中国生物医学文献服务系统(SinoMed)、PubMed、Web of Science、Embase建库至2023年10月13日收录的穴位刺激治疗癌因性疲乏的临床随机对照试验文献,采用Excel2021和R4.3.1对处方腧穴进行频次分析、关联规则分析和聚类分析。结果共纳入文献187篇,包含214条腧穴处方,涉及102个腧穴,使用总频次为1044。使用频次前3位腧穴为足三里、气海、关元,前3位经络为任脉、足阳明胃经和足太阴脾经;腧穴多分布在下肢部和胸腹部,交会穴为使用频次最高的特定穴,艾灸是最常用的干预措施。关联规则分析显示,足三里-气海-关元-三阴交为核心腧穴组合。对肺癌、乳腺癌、结直肠癌、妇科肿瘤和胃癌选穴分析显示,补虚要穴最常选用。聚类分析得到5组腧穴有效聚类群。结论穴位刺激治疗癌因性疲乏选穴以益气养血、扶正祛邪为主要原则,重视特定穴的使用,并根据脏腑经络辨证运用多种配穴方法以提高临床疗效。