期刊文献+
共找到678篇文章
< 1 2 34 >
每页显示 20 50 100
An Adaptive Privacy Preserving Framework for Distributed Association Rule Mining in Healthcare Databases
1
作者 Hasanien K.Kuba Mustafa A.Azzawi +2 位作者 Saad M.Darwish Oday A.Hassen Ansam A.Abdulhussein 《Computers, Materials & Continua》 SCIE EI 2023年第2期4119-4133,共15页
It is crucial,while using healthcare data,to assess the advantages of data privacy against the possible drawbacks.Data from several sources must be combined for use in many data mining applications.The medical practit... It is crucial,while using healthcare data,to assess the advantages of data privacy against the possible drawbacks.Data from several sources must be combined for use in many data mining applications.The medical practitioner may use the results of association rule mining performed on this aggregated data to better personalize patient care and implement preventive measures.Historically,numerous heuristics(e.g.,greedy search)and metaheuristics-based techniques(e.g.,evolutionary algorithm)have been created for the positive association rule in privacy preserving data mining(PPDM).When it comes to connecting seemingly unrelated diseases and drugs,negative association rules may be more informative than their positive counterparts.It is well-known that during negative association rules mining,a large number of uninteresting rules are formed,making this a difficult problem to tackle.In this research,we offer an adaptive method for negative association rule mining in vertically partitioned healthcare datasets that respects users’privacy.The applied approach dynamically determines the transactions to be interrupted for information hiding,as opposed to predefining them.This study introduces a novel method for addressing the problem of negative association rules in healthcare data mining,one that is based on the Tabu-genetic optimization paradigm.Tabu search is advantageous since it removes a huge number of unnecessary rules and item sets.Experiments using benchmark healthcare datasets prove that the discussed scheme outperforms state-of-the-art solutions in terms of decreasing side effects and data distortions,as measured by the indicator of hiding failure. 展开更多
关键词 distributed data mining evolutionary computation sanitization process healthcare informatics
下载PDF
Correlation knowledge extraction based on data mining for distribution network planning
2
作者 Zhifang Zhu Zihan Lin +4 位作者 Liping Chen Hong Dong Yanna Gao Xinyi Liang Jiahao Deng 《Global Energy Interconnection》 EI CSCD 2023年第4期485-492,共8页
Traditional distribution network planning relies on the professional knowledge of planners,especially when analyzing the correlations between the problems existing in the network and the crucial influencing factors.Th... Traditional distribution network planning relies on the professional knowledge of planners,especially when analyzing the correlations between the problems existing in the network and the crucial influencing factors.The inherent laws reflected by the historical data of the distribution network are ignored,which affects the objectivity of the planning scheme.In this study,to improve the efficiency and accuracy of distribution network planning,the characteristics of distribution network data were extracted using a data-mining technique,and correlation knowledge of existing problems in the network was obtained.A data-mining model based on correlation rules was established.The inputs of the model were the electrical characteristic indices screened using the gray correlation method.The Apriori algorithm was used to extract correlation knowledge from the operational data of the distribution network and obtain strong correlation rules.Degree of promotion and chi-square tests were used to verify the rationality of the strong correlation rules of the model output.In this study,the correlation relationship between heavy load or overload problems of distribution network feeders in different regions and related characteristic indices was determined,and the confidence of the correlation rules was obtained.These results can provide an effective basis for the formulation of a distribution network planning scheme. 展开更多
关键词 distribution network planning data mining Apriori algorithm Gray correlation analysis Chi-square test
下载PDF
A Distributed Data Mining System Based on Multi-agent Technology 被引量:1
3
作者 郭黎明 张艳珍 《Journal of Donghua University(English Edition)》 EI CAS 2006年第6期80-83,共4页
Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data... Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data mining have been developed, but only a few of them make use of intelligent agents. This paper provides the reason for applying Multi-Agent Technology in Distributed Data Mining and presents a Distributed Data Mining System based on Multi-Agent Technology that deals with heterogeneity in such environment. Based on the advantages of both the CS model and agent-based model, the system is being able to address the specific concern of increasing scalability and enhancing performance. 展开更多
关键词 分布式数据挖掘算法 多代理技术 服务器 计算机技术 信息处理
下载PDF
Research and Application of Distributed Data Mining Method for Improving Rural Power Grid Enterprises in Production and Operation Status Evaluation
4
作者 Gao Xiu-yun Xiang Wen Fang Jun-long 《Journal of Northeast Agricultural University(English Edition)》 CAS 2019年第2期87-96,共10页
With the reform of rural network enterprise system,the speed of transfer property rights in rural power enterprises is accelerated.The evaluation of the operation and development status of rural power enterprises is d... With the reform of rural network enterprise system,the speed of transfer property rights in rural power enterprises is accelerated.The evaluation of the operation and development status of rural power enterprises is directly related to the future development and investment direction of rural power enterprises.At present,the evaluation of the production and operation of rural network enterprises and the development status of power network only relies on the experience of the evaluation personnel,sets the reference index,and forms the evaluation results through artificial scoring.Due to the strong subjective consciousness of the evaluation results,the practical guiding significance is weak.Therefore,distributed data mining method in rural power enterprises status evaluation was proposed which had been applied in many fields,such as food science,economy or chemical industry.The distributed mathematical model was established by using principal component analysis(PCA)and regression analysis.By screening various technical indicators and determining their relevance,the reference value of evaluation results was improved.Combined with statistical program for social sciences(SPSS)data analysis software,the operation status of rural network enterprises was evaluated,and the rationality,effectiveness and economy of the evaluation was verified through comparison with current evaluation results and calculation examples of actual grid operation data. 展开更多
关键词 RURAL power grid PRODUCTION and management distributed data mining STATISTICAL program for SOCIAL sciences(SPSS19)
下载PDF
MA-IDS: A Distributed Intrusion Detection System Based on Data Mining
5
作者 SUNJian-hua JINHai CHENHao HANZong-fen 《Wuhan University Journal of Natural Sciences》 CAS 2005年第1期111-114,共4页
Aiming at the shortcomings in intrusion detection systems (IDSs) used incommercial and research fields, we propose the MA-IDS system, a distributed intrusion detectionsystem based on data mining. In this model, misuse... Aiming at the shortcomings in intrusion detection systems (IDSs) used incommercial and research fields, we propose the MA-IDS system, a distributed intrusion detectionsystem based on data mining. In this model, misuse intrusion detection system CM1DS) and anomalyintrusion de-lection system (AIDS) are combined. Data mining is applied to raise detectionperformance, and distributed mechanism is employed to increase the scalability and efficiency. Host-and network-based mining algorithms employ an improved. Bayes-ian decision theorem that suits forreal security environment to minimize the risks incurred by false decisions. We describe the overallarchitecture of the MA-IDS system, and discuss specific design and implementation issue. 展开更多
关键词 intrusion detection data mining distributed system
下载PDF
Designing a Model to Study Data Mining in Distributed Environment
6
作者 Md. Abadur Rahman Masud Karim 《Journal of Data Analysis and Information Processing》 2021年第1期23-29,共7页
To make business policy, market analysis, corporate decision, fraud detection, etc., we have to analyze and work with huge amount of data. Generally, such data are taken from different sources. Researchers are using d... To make business policy, market analysis, corporate decision, fraud detection, etc., we have to analyze and work with huge amount of data. Generally, such data are taken from different sources. Researchers are using data mining to perform such tasks. Data mining techniques are used to find hidden information from large data source. Data mining is using for various fields: Artificial intelligence, Bank, health and medical, corruption, legal issues, corporate business, marketing, etc. Special interest is given to associate rules, data mining algorithms, decision tree and distributed approach. Data is becoming larger and spreading geographically. So it is difficult to find better result from only a central data source. For knowledge discovery, we have to work with distributed database. On the other hand, security and privacy considerations are also another factor for de-motivation of working with centralized data. For this reason, distributed database is essential for future processing. In this paper, we have proposed a framework to study data mining in distributed environment. The paper presents a framework to bring out actionable knowledge. We have shown some level by which we can generate actionable knowledge. Possible tools and technique for these levels are discussed. 展开更多
关键词 data mining distributed database Knowledge Discovery Classification Algorithm
下载PDF
Research on Rolling Load Distribution Method based on Data Mining 被引量:1
7
作者 ZHANG Yan-hua LIU Xiang-hua WANG Guo-dong 《Journal of Iron and Steel Research(International)》 SCIE CAS CSCD 2005年第6期30-32,53,共4页
A new method of establishing rolling load distribution model was developed by online intelligent information-processing technology for plate rolling. The model combines knowledge model and mathematical model with usin... A new method of establishing rolling load distribution model was developed by online intelligent information-processing technology for plate rolling. The model combines knowledge model and mathematical model with using knowledge discovery in database (KDD) and data mining (DM) as the start. The online maintenance and optimization of the load model are realized. The effectiveness of this new method was testified by offline simulation and online application. 展开更多
关键词 rolling load distribution information processing knowledge discovery data mining
下载PDF
A Fast Distributed Algorithm for Association Rule Mining Based on Binary Coding Mapping Relation
8
作者 CHEN Geng NI Wei-wei +1 位作者 ZHU Yu-quan SUN Zhi-hui 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期27-30,共4页
Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only ... Association rule mining is an important issue in data mining. The paper proposed an binary system based method to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only some operations such as "and", "or" and "xor". Applying this idea in the existed distributed association rule mining al gorithm FDM, the improved algorithm BFDM is proposed. The theoretical analysis and experiment testify that BFDM is effective and efficient. 展开更多
关键词 frequent itemsets distributed association rule mining relation of itemsets-binary data
下载PDF
A New Approach for Knowledge Discovery in Distributed Databases Using Fragmented Data Storage Model
9
作者 Masoud Pesaran Behbahani Islam Choudhury Souheil Khaddaj 《Chinese Business Review》 2013年第12期834-845,共12页
关键词 数据存储模型 分布式数据库 知识发现 决策支持系统 智能化管理 分散 决策支持模型 智能方法
下载PDF
基于DDMINER分布式数据库系统中频繁项目集的更新 被引量:15
10
作者 吉根林 杨明 +1 位作者 赵斌 孙志挥 《计算机学报》 EI CSCD 北大核心 2003年第10期1387-1392,共6页
给出了一种分布式数据挖掘系统的体系结构DDMINER ,对分布式数据库系统中频繁项目集的更新问题进行探讨 ,既考虑了数据库中事务增加的情况 ,又考虑了事务删除的情况 ;提出了一种基于DDMINER的局部频繁项目集的更新算法ULF和全局频繁项... 给出了一种分布式数据挖掘系统的体系结构DDMINER ,对分布式数据库系统中频繁项目集的更新问题进行探讨 ,既考虑了数据库中事务增加的情况 ,又考虑了事务删除的情况 ;提出了一种基于DDMINER的局部频繁项目集的更新算法ULF和全局频繁项目集的更新算法UGF .该算法能够产生较少数量的候选频繁项目集 ,在求解全局频繁项目集过程中 ,传送候选局部频繁项目集支持数的通信量为O(n) ;将文章提出的算法用Java语言加以实现 ,并对算法性能进行了研究 ;实验结果表明这些算法是正确、可行的 ,并且具有较高的效率. 展开更多
关键词 分布式数据库系统 频繁项目集 分布式数据挖掘系统 体系结构 ddmINER
下载PDF
Web Page Recommendation Using Distributional Recurrent Neural Network
11
作者 Chaithra G.M.Lingaraju S.Jagannatha 《Computer Systems Science & Engineering》 SCIE EI 2023年第4期803-817,共15页
In the data retrieval process of the Data recommendation system,the matching prediction and similarity identification take place a major role in the ontology.In that,there are several methods to improve the retrieving... In the data retrieval process of the Data recommendation system,the matching prediction and similarity identification take place a major role in the ontology.In that,there are several methods to improve the retrieving process with improved accuracy and to reduce the searching time.Since,in the data recommendation system,this type of data searching becomes complex to search for the best matching for given query data and fails in the accuracy of the query recommendation process.To improve the performance of data validation,this paper proposed a novel model of data similarity estimation and clustering method to retrieve the relevant data with the best matching in the big data processing.In this paper advanced model of the Logarithmic Directionality Texture Pattern(LDTP)method with a Metaheuristic Pattern Searching(MPS)system was used to estimate the similarity between the query data in the entire database.The overall work was implemented for the application of the data recommendation process.These are all indexed and grouped as a cluster to form a paged format of database structure which can reduce the computation time while at the searching period.Also,with the help of a neural network,the relevancies of feature attributes in the database are predicted,and the matching index was sorted to provide the recommended data for given query data.This was achieved by using the Distributional Recurrent Neural Network(DRNN).This is an enhanced model of Neural Network technology to find the relevancy based on the correlation factor of the feature set.The training process of the DRNN classifier was carried out by estimating the correlation factor of the attributes of the dataset.These are formed as clusters and paged with proper indexing based on the MPS parameter of similarity metric.The overall performance of the proposed work can be evaluated by varying the size of the training database by 60%,70%,and 80%.The parameters that are considered for performance analysis are Precision,Recall,F1-score and the accuracy of data retrieval,the query recommendation output,and comparison with other state-of-art methods. 展开更多
关键词 ONTOLOGY data mining in big data logarithmic directionality texture pattern metaheuristic pattern searching system distributional recurrent neural network query recommendation
下载PDF
面向密度分布不均数据的加权逆近邻密度峰值聚类算法
12
作者 吕莉 陈威 +2 位作者 肖人彬 韩龙哲 谭德坤 《智能系统学报》 CSCD 北大核心 2024年第1期165-175,共11页
针对密度分布不均数据,密度峰值聚类算法易忽略类簇间样本的疏密差异,导致误选类簇中心;分配策略易将稀疏区域的样本误分到密集区域,导致聚类效果不佳的问题,本文提出一种面向密度分布不均数据的加权逆近邻密度峰值聚类算法。该算法首... 针对密度分布不均数据,密度峰值聚类算法易忽略类簇间样本的疏密差异,导致误选类簇中心;分配策略易将稀疏区域的样本误分到密集区域,导致聚类效果不佳的问题,本文提出一种面向密度分布不均数据的加权逆近邻密度峰值聚类算法。该算法首先在局部密度公式中引入基于sigmoid函数的权重系数,增加稀疏区域样本的权重,结合逆近邻思想,重新定义了样本的局部密度,有效提升类簇中心的识别率;其次,引入改进的样本相似度策略,利用样本间的逆近邻及共享逆近邻信息,使得同一类簇样本间具有较高的相似度,可有效改善稀疏区域样本分配错误的问题。在密度分布不均、复杂形态和UCI数据集上的对比实验表明,本文算法的聚类效果优于IDPC-FA、FNDPC、FKNN-DPC、DPC和DPCSA算法。 展开更多
关键词 密度峰值聚类 密度分布不均 逆近邻 共享逆近邻 样本相似度 局部密度 分配策略 数据挖掘
下载PDF
基于动态R-树结构的DDM区域匹配算法 被引量:5
13
作者 王磊 张慧慧 +1 位作者 李开生 鞠鸿彬 《计算机工程》 CAS CSCD 北大核心 2008年第3期56-58,共3页
分析了传统数据分发管理(DDM)匹配方法,结合空间索引技术的特点,提出了一种动态R-树区域匹配方法,通过建立R-树对公布和订购区域进行组织管理,并在R-树上实现订购区域与公布区域的匹配搜索。仿真实验结果表明,选取适当参数M,可减少动态R... 分析了传统数据分发管理(DDM)匹配方法,结合空间索引技术的特点,提出了一种动态R-树区域匹配方法,通过建立R-树对公布和订购区域进行组织管理,并在R-树上实现订购区域与公布区域的匹配搜索。仿真实验结果表明,选取适当参数M,可减少动态R-树DDM匹配算法的时间开销,达到较优性能。 展开更多
关键词 R-树 数据分发管理(ddm) 区域匹配 空间索引 高层体系结构
下载PDF
一种基于历史的DDM实现方法 被引量:2
14
作者 万江华 史军慧 +2 位作者 姚益平 卢锡城 时向泉 《计算机工程与应用》 CSCD 北大核心 2001年第15期80-81,87,共3页
DDM是HLA接口规范中定义的六大服务之一。它允许盟员声明其在路由空间中的更新区域和定购区域,从而达到减少HLA盟员间数据交换量的目的。它是RTI服务实现效率的关键,其核心问题是如何减少需要匹配的区域,以减少计算量。文章简要介绍... DDM是HLA接口规范中定义的六大服务之一。它允许盟员声明其在路由空间中的更新区域和定购区域,从而达到减少HLA盟员间数据交换量的目的。它是RTI服务实现效率的关键,其核心问题是如何减少需要匹配的区域,以减少计算量。文章简要介绍了现有的几种方法,并分析了其各自的特点,在此基础上提出了一种基于历史的DDM方法。 展开更多
关键词 高层体系结构 数据分布管理 路由空间 分布式交互仿真
下载PDF
基于MapReduce的大规模网络社区发现算法
15
作者 王瀚橙 戴海鹏 +2 位作者 陈志鹏 陈树森 陈贵海 《计算机科学》 CSCD 北大核心 2024年第4期11-18,共8页
社区发现是社会网络挖掘领域的基本问题。随着海量数据的迅速产生,传统社区发现算法愈发难以处理大规模社会网络。因此,针对大规模网络设计高效的社区发现算法意义重大。文中提出了一种基于MapReduce和k中心聚类的新型分布式算法。首先... 社区发现是社会网络挖掘领域的基本问题。随着海量数据的迅速产生,传统社区发现算法愈发难以处理大规模社会网络。因此,针对大规模网络设计高效的社区发现算法意义重大。文中提出了一种基于MapReduce和k中心聚类的新型分布式算法。首先,该算法提出“朋友圈系数”技术,该技术可更加准确地度量结点间的距离。其次,该算法提出“两阶段k中心聚类”技术,该技术在选取中心点过程中融入结点中心度启发式信息,可显著优化输出结果的模块度。最后,该算法提出“以模块度为优化目标的社区融合”技术,该技术能够在无先验知识的前提下自动确定网络中的社区数目。实验结果表明,所提算法的社区发现结果模块度明显优于最先进的社区发现算法。例如,相比LPA算法,其将模块度平均提升9.19倍。 展开更多
关键词 社区发现 k中心聚类 分布式计算 数据挖掘 大数据
下载PDF
一种改进排序匹配算法在DDM中的应用与实现 被引量:2
16
作者 王磊 张慧慧 +1 位作者 李开生 鞠鸿彬 《计算机工程与应用》 CSCD 北大核心 2007年第33期161-163,210,共4页
数据分发管理功能是降低网络冗余数据的有效手段,它是实现HLA-RTI的关键技术。结合IEEE1516介绍了数据分发管理过滤机制以及传统的匹配方法,在分析排序算法匹配原理的基础上,给出了排序算法实现订购区域与公布区域的匹配策略,针对排序... 数据分发管理功能是降低网络冗余数据的有效手段,它是实现HLA-RTI的关键技术。结合IEEE1516介绍了数据分发管理过滤机制以及传统的匹配方法,在分析排序算法匹配原理的基础上,给出了排序算法实现订购区域与公布区域的匹配策略,针对排序算法在区域数目较大时出现的运行时间长、存储空间占用大的弊端,提出了一种改进的排序算法。通过仿真实验表明改进后的排序算法在区域数目较大时所需的时间开销较少,并且在区域边长发生变化的情况下具有较好的平稳性。 展开更多
关键词 数据分发管理(ddm) 高层体系结构(HLA) 运行时间支撑结构(RTI) 排序算法 公布/订购
下载PDF
一种混合的动态DDM实现方法 被引量:2
17
作者 张霞 黄莎白 《计算机工程》 CAS CSCD 北大核心 2003年第20期14-15,179,共3页
介绍了HLA中数据分发管理DDM的基本内容和过程,分析了目前两种经典的DDM实现方法;在此基础上综合了现有方法的优点,提出了一种混合的动态的DDM实现方法,提高了区域匹配的精度,降低了网络资源的消耗,对DDM方法进行了改进。
关键词 高层体系结构 HLA 数据分发管理 ddm 区域匹配 组播分配 分布交互式仿真
下载PDF
基于MapReduce的H-mine算法 被引量:3
18
作者 冯兴杰 赵杰 《计算机应用研究》 CSCD 北大核心 2016年第3期754-758,共5页
频繁模式挖掘是一种非常有效地从数据中获取知识的方法,但是随着大数据时代的来临,现有算法及其计算环境的运算速度、内外存容量面临严峻挑战。针对以上问题,紧密结合MapReduce模型提供的高效分布式编程和运行框架,在深入分析H-mine频... 频繁模式挖掘是一种非常有效地从数据中获取知识的方法,但是随着大数据时代的来临,现有算法及其计算环境的运算速度、内外存容量面临严峻挑战。针对以上问题,紧密结合MapReduce模型提供的高效分布式编程和运行框架,在深入分析H-mine频繁模式挖掘算法的基础上,通过对H-mine算法频繁模式挖掘过程的并行化改进,提出了一种新颖的基于MapReduce模型的H-mine算法(简称MRH-mine)。MRH-mine算法实现了对H-mine算法在分布式运行环境下的改造,实验表明该算法在面对数据大规模增长的情况下具有良好的性能和扩展性。 展开更多
关键词 分布式数据挖掘 MAPREDUCE H-mine 并行化 HADOOP
下载PDF
KD-RTI/DDM中的组播分配策略 被引量:1
19
作者 曲庆军 刘秀罗 +1 位作者 吴作顺 黄柯棣 《国防科技大学学报》 EI CAS CSCD 北大核心 2003年第2期79-83,共5页
组播技术的应用可以有效提高RTI的实现性能。介绍了组播分配的一般理论和实现方案,详细展示了KD RTI中的组播分配策略,讨论了组播应用过程中出现的问题,同时给出了解决方法。
关键词 高层体系结构 数据分发管理 运行时间支撑结构 区域 兴趣管理 组播
下载PDF
分布式网络环境下DDM系统层次仿真框架 被引量:1
20
作者 王磊 李开生 +1 位作者 张慧慧 鞠鸿彬 《系统仿真学报》 EI CAS CSCD 北大核心 2007年第20期4689-4693,4834,共6页
在基于HLA的大规模交互仿真中,如何灵活、高效地实现分布式网络仿真成为迫切需要解决的问题。HLA中的数据分发管理机制为实现大规模交互仿真提供了可能。分析了数据分发实现过程及过滤原理,结合分布式网络的特点,构建了一种层次化数据... 在基于HLA的大规模交互仿真中,如何灵活、高效地实现分布式网络仿真成为迫切需要解决的问题。HLA中的数据分发管理机制为实现大规模交互仿真提供了可能。分析了数据分发实现过程及过滤原理,结合分布式网络的特点,构建了一种层次化数据分发管理系统仿真结构,提出了层次化DDM系统仿真策略,并对层次化DDM仿真系统的性能进行了分析对比,说明了层次化仿真框架的可行性及优越性。 展开更多
关键词 高层体系结构 数据分发管理 层次化仿真框架 区域匹配 组播组分配
下载PDF
上一页 1 2 34 下一页 到第
使用帮助 返回顶部