期刊文献+
共找到899篇文章
< 1 2 45 >
每页显示 20 50 100
基于模式语义的非膨胀性Pattern Tree简化
1
作者 邓娜 王宇 +3 位作者 黄炜 李博 齐耀龙 高丽敏 《计算机工程与应用》 CSCD 北大核心 2006年第17期149-152,187,共5页
近年来,XQuery逐渐成为XML查询语言的事实标准。PatternTree被广泛应用于XQuery查询处理。PatternTree用匹配方法处理查询,其本身的大小和复杂程度决定了匹配的效率。该文论述了利用模式语义简化PatternTree的规则。这种方法避免了因插... 近年来,XQuery逐渐成为XML查询语言的事实标准。PatternTree被广泛应用于XQuery查询处理。PatternTree用匹配方法处理查询,其本身的大小和复杂程度决定了匹配的效率。该文论述了利用模式语义简化PatternTree的规则。这种方法避免了因插入冗余节点而导致PatternTree的膨胀。三个判断冗余节点的规则不但可以判断叶节点的冗余,而且可以在保留叶节点的情况下,判断非叶节点的冗余。实验证明了方法的可行性和有效性。 展开更多
关键词 Pattem tree 简化 XML
下载PDF
Pattern Tree最小简单分解策略
2
作者 邓娜 王宇 +3 位作者 范士勇 齐鸿志 安海宁 肖胜刚 《计算机工程与应用》 CSCD 北大核心 2006年第18期160-163,共4页
PatternTree(PTQ)是复杂的树状结构,求解PTQ需要在XML文档集合中查找与之匹配的数据。在PTQ中,只有部分节点是需要输出的查询目标节点,其余节点则是中间结果。因此,如何在求解过程中尽量避免中间结果的产生,是XML查询优化面临的一个关... PatternTree(PTQ)是复杂的树状结构,求解PTQ需要在XML文档集合中查找与之匹配的数据。在PTQ中,只有部分节点是需要输出的查询目标节点,其余节点则是中间结果。因此,如何在求解过程中尽量避免中间结果的产生,是XML查询优化面临的一个关键问题。论文提出了一种PTQ分解策略,能够根据不同查询操作的粒度,将PTQ分解为可用一种内部操作实现的最大片断集合。从而有效的限制中间结果的产生,提高查询效率。 展开更多
关键词 Pattem tree 分解 XML
下载PDF
Improved Pattern Tree for Incremental Frequent-Pattern Mining 被引量:1
3
作者 周明 王太勇 《Transactions of Tianjin University》 EI CAS 2010年第2期129-134,共6页
By analyzing the existing prefix-tree data structure, an improved pattern tree was introduced for processing new transactions. It firstly stored transactions in a lexicographic order tree and then restructured the tre... By analyzing the existing prefix-tree data structure, an improved pattern tree was introduced for processing new transactions. It firstly stored transactions in a lexicographic order tree and then restructured the tree by sorting each path in a frequency-descending order. While updating the improved pattern tree, there was no need to rescan the entire new database or reconstruct a new tree for incremental updating. A test was performed on synthetic dataset T10I4D100K with 100,000 transactions and 870 items. Experimental results show that the smaller the minimum support threshold, the faster the improved pattern tree achieves over CanTree for all datasets. As the minimum support threshold increased from 2% to 3.5%, the runtime decreased from 452.71 s to 186.26 s. Meanwhile, the runtime required by CanTree decreased from 1,367.03 s to 432.19 s. When the database was updated, the execution time of im- proved pattern tree consisted of construction of original improved pattern trees and reconstruction of initial tree. The experiment results showed that the runtime was saved by about 15% compared with that of CanTree. As the number of transactions increased, the runtime of improved pattern tree was about 25% shorter than that of FP-tree. The improved pattern tree also required less memory than CanTree. 展开更多
关键词 增量更新 挖掘模式 数据结构 电源端口 即时通讯 执行时间 数据库 数据集
下载PDF
Association RuleMining Frequent-Pattern-Based Intrusion Detection in Network
4
作者 S.Sivanantham V.Mohanraj +1 位作者 Y.Suresh J.Senthilkumar 《Computer Systems Science & Engineering》 SCIE EI 2023年第2期1617-1631,共15页
In the network security system,intrusion detection plays a significant role.The network security system detects the malicious actions in the network and also conforms the availability,integrity and confidentiality of da... In the network security system,intrusion detection plays a significant role.The network security system detects the malicious actions in the network and also conforms the availability,integrity and confidentiality of data informa-tion resources.Intrusion identification system can easily detect the false positive alerts.If large number of false positive alerts are created then it makes intrusion detection system as difficult to differentiate the false positive alerts from genuine attacks.Many research works have been done.The issues in the existing algo-rithms are more memory space and need more time to execute the transactions of records.This paper proposes a novel framework of network security Intrusion Detection System(IDS)using Modified Frequent Pattern(MFP-Tree)via K-means algorithm.The accuracy rate of Modified Frequent Pattern Tree(MFPT)-K means method infinding the various attacks are Normal 94.89%,for DoS based attack 98.34%,for User to Root(U2R)attacks got 96.73%,Remote to Local(R2L)got 95.89%and Probe attack got 92.67%and is optimal when it is compared with other existing algorithms of K-Means and APRIORI. 展开更多
关键词 IDS K-MEANS frequent pattern tree false alert MINING L1-norm
下载PDF
Introducing tree neighbouring relationship factors in forest pattern spatial analysis:weighted Delaunay triangulation method 被引量:4
5
作者 Jianjun Li Kaiwen Zhu +4 位作者 Shuai Liu Dan Li Gui Zhang Xing Liu Weiming Yang 《Journal of Forestry Research》 SCIE CAS CSCD 2021年第5期1941-1951,共11页
It is important to quantify and analyze forest spatial patterns for studying biological characteristics,population interaction and the relationship between the population and environment.In this study,the forest spati... It is important to quantify and analyze forest spatial patterns for studying biological characteristics,population interaction and the relationship between the population and environment.In this study,the forest spatial structure unit was generated based on the Delaunay triangulation model(DTM),and the weights were generated using the comprehensive values of the tree diameter at breast height,total height and crown width.The distance between neighbors determined by the DTM was weighted to transform the original coordinates of trees into logical coordinates.Then,a weighted spatial pattern(WSP)was developed.After weighting,the neighboring trees were replaced,the replacement ratio was 38.3%,and there was 57.4%of the central tree.Correlation analysis showed that the uniform angle index of the WSP was significantly correlated with the tree size standard deviation under uniformity(r=0.932)and randomness(r=0.711).The DTM method not only considers the spatial distance between trees,but also considers the non-spatial attributes of trees.By changing the spatial topological relation between trees,this method further improves the spatial structure measurement of forest. 展开更多
关键词 Delaunay triangulation network model Weighting tree spatial pattern tree size standard deviation
下载PDF
Generating IDS Attack Pattern Automatically Based on Attack Tree 被引量:1
6
作者 向尕 曹元大 《Journal of Beijing Institute of Technology》 EI CAS 2003年第2期138-142,共5页
Generating attack pattern automatically based on attack tree is studied. The extending definition of attack tree is proposed. And the algorithm of generating attack tree is presented. The method of generating attack p... Generating attack pattern automatically based on attack tree is studied. The extending definition of attack tree is proposed. And the algorithm of generating attack tree is presented. The method of generating attack pattern automatically based on attack tree is shown, which is tested by concrete attack instances. The results show that the algorithm is effective and efficient. In doing so, the efficiency of generating attack pattern is improved and the attack trees can be reused. 展开更多
关键词 attack tree attack pattern IDS (intrusion detection system)
下载PDF
Mining Maximal Frequent Patterns in a Unidirectional FP-tree 被引量:1
7
作者 宋晶晶 刘瑞新 +1 位作者 王艳 姜保庆 《Journal of Donghua University(English Edition)》 EI CAS 2006年第6期105-109,共5页
Because mining complete set of frequent patterns from dense database could be impractical, an interesting alternative has been proposed recently. Instead of mining the complete set of frequent patterns, the new model ... Because mining complete set of frequent patterns from dense database could be impractical, an interesting alternative has been proposed recently. Instead of mining the complete set of frequent patterns, the new model only finds out the maximal frequent patterns, which can generate all frequent patterns. FP-growth algorithm is one of the most efficient frequent-pattern mining methods published so far. However, because FP-tree and conditional FP-trees must be two-way traversable, a great deal memory is needed in process of mining. This paper proposes an efficient algorithm Unid_FP-Max for mining maximal frequent patterns based on unidirectional FP-tree. Because of generation method of unidirectional FP-tree and conditional unidirectional FP-trees, the algorithm reduces the space consumption to the fullest extent. With the development of two techniques: single path pruning and header table pruning which can cut down many conditional unidirectional FP-trees generated recursively in mining process, Unid_FP-Max further lowers the expense of time and space. 展开更多
关键词 数据库 计算机技术 数据处理 信息技术 数据挖掘
下载PDF
Study on Bush Tree Growth in XLPE Using 3D PD Patterns
8
作者 吴广宇 童晓阳 《Journal of Modern Transportation》 2000年第2期152-161,共10页
Exploitation of equipment with cross linked polyethylene (XLPE ) insulation requires its condition monitoring and diagnostic. Traditionally diagnostics of insulation is carried out by means of partial discharge detect... Exploitation of equipment with cross linked polyethylene (XLPE ) insulation requires its condition monitoring and diagnostic. Traditionally diagnostics of insulation is carried out by means of partial discharge detection. However, such identification of a defect, for example, void, inclusion or treeing, does not say about its danger from a point of view of full insulation gap breakdown and insulation construction failure. For this purpose a 29 kV CN-CV cable sample is studied. The experiment is based on research for determination of the dependencies between PD characteristics in XLPE upon time and three dimension PD patterns of corresponding treeing. The investigations were carried out by means of electrical measurement of PD current and simultaneous optical recording of treeing image. The needleplane electrode is applied as the electrode. As a result, -q-n PD patterns which are used as the bases to bush tree initialization and growth can be obtained. Test results show that PD pattern recognition can be applied as a powerful tool for recognizing electrical tree initialization and growth. This can make a good basis for on-line condition monitoring of high voltage power cable. 展开更多
关键词 partial discharge bush tree pattern recognition condition monitoring
下载PDF
A Tree Pattern Matching Algorithm for XML Queries with Structural Preferences
9
作者 Maurice Tchoupé Tchendji Lionel Tadonfouet Thomas Tébougang Tchendji 《Journal of Computer and Communications》 2019年第1期61-83,共23页
In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly comple... In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind;this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results;moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment. 展开更多
关键词 SEMI-STRUCTURED Documents Preference QUERIES tree pattern Matching treeMatch Algorithm XML The SKYLINE Operator
下载PDF
Ethnographic Decision Tree Modeling of the Decision Criteria and Decision Patterns for Adult Married Women with Unexpected Pregnancies
10
作者 Yu-Chan Li Yieh Loong Tsai Pei-Jung Lan 《Open Journal of Obstetrics and Gynecology》 2017年第10期1052-1063,共12页
Introduction: As far as adult and married women were concerned, when they occurred to “unplanned pregnancy”, they felt so surprised and concussive all the time. Besides, the unplanned pregnancy also affects the othe... Introduction: As far as adult and married women were concerned, when they occurred to “unplanned pregnancy”, they felt so surprised and concussive all the time. Besides, the unplanned pregnancy also affects the other members in the family system. Therefore, when married women have to face the choice: “birth” or “abortion”, they’ll consider lots of thoughts and different decision criteria and decision pattern under various influences on physician, mind, mental and society. The purpose of this study was to investigate the criteria considered and the decision patterns involved when adult married women decide whether to terminate or continue an unplanned pregnancy. Methods: The study uses the method—“Ethnographic Decision Tree Modeling” [1] to build model of the decision criteria and decision patterns involved when adult married women make a decision about their unplanned pregnancy. There are three process in the research method: “Pilot Study”—interview two groups, every group distinct 4 married adult women with unplanned pregnancies, which decide whether to terminate or continue an unplanned pregnancy, what is the items of decision characters affect to the choice: “birth” or “abortion”. “Building of the Model”, displays the importance in proper order of those items and build the modeling with these two groups of women. “Testing of the Model”: investigate the criteria considered and the decision patterns involved when adult married women decide whether to terminate or continue an unplanned pregnancy. The study interviewed 34 married adult women with 43 unplanned pregnancies totally. Results: The result of the study finds out 12 items of decision characters, including planning to get pregnant or not, stability of feelings for married partner, the points of view on life, was affected by mother, mother-in-law, an husband’s emphasis on male, the meanings of children, the financial burden, the plan an assignment of career and time, the past pregnant experiences, the status of raising children, the health of parents and fetus, the effect of living environment, and social and cultural vision. Besides, there are four decision patterns of married adult women with unplanned pregnancy are “receiving abortion positively”;“giving birth as long as getting pregnancy naturally”;“ the minds are hesitative and changeable”, and “being forced by important others.” Conclusion: By setting the decision model tree, we found several decision criteria and patterns, and possible modes actions to be taken, could offer to see the adult married women’s decision-making and struggles in mind about unplanned pregnancy. 展开更多
关键词 Ethnographic DECISION tree Modeling ADULT and MARRIED Women UNPLANNED Pregnancy DECISION pattern
下载PDF
基于FP-Tree的最大频繁项目集挖掘及更新算法 被引量:164
11
作者 宋余庆 朱玉全 +1 位作者 孙志挥 陈耿 《软件学报》 EI CSCD 北大核心 2003年第9期1586-1592,共7页
挖掘最大频繁项目集是多种数据挖掘应用中的关键问题,之前的很多研究都是采用Apriori类的候选项目集生成-检验方法.然而,候选项目集产生的代价是很高的,尤其是在存在大量强模式和/或长模式的时候.提出了一种快速的基于频繁模式树(FP-tr... 挖掘最大频繁项目集是多种数据挖掘应用中的关键问题,之前的很多研究都是采用Apriori类的候选项目集生成-检验方法.然而,候选项目集产生的代价是很高的,尤其是在存在大量强模式和/或长模式的时候.提出了一种快速的基于频繁模式树(FP-tree)的最大频繁项目集挖掘DMFIA(discover maximum frequent itemsets algorithm)及其更新算法UMFIA(update maximum frequent itemsets algorithm).算法UMFIA将充分利用以前的挖掘结果来减少在更新的数据库中发现新的最大频繁项目集的费用. 展开更多
关键词 数据挖掘 最大频繁项目集 关联规则 频繁模式树 增量式更新
下载PDF
基于FP-tree和约束概念格的关联规则挖掘算法及应用研究 被引量:25
12
作者 付冬梅 王志强 《计算机应用研究》 CSCD 北大核心 2014年第4期1013-1015,1019,共4页
现有的基于频繁模式树FP-tree和概念格的规则挖掘算法在构造概念格时存在重复遍历FP-tree问题,在挖掘后件约束的规则时算法构造的概念格包含冗余结点。针对这两个问题,提出了通过遍历FP-tree生成候选概念格节点的策略,并根据候选概念格... 现有的基于频繁模式树FP-tree和概念格的规则挖掘算法在构造概念格时存在重复遍历FP-tree问题,在挖掘后件约束的规则时算法构造的概念格包含冗余结点。针对这两个问题,提出了通过遍历FP-tree生成候选概念格节点的策略,并根据候选概念格节点进一步构造规则约束条件下无冗余概念格。通过实际项目中大气腐蚀数据进行算法的应用,结果表明该算法比现有算法具有更高的挖掘效率且腐蚀规则结果对材料腐蚀现状研究具有重要指导价值。 展开更多
关键词 规则挖掘 频繁模式树 约束概念格 材料腐蚀
下载PDF
基于FP-tree的最大频繁模式挖掘算法 被引量:18
13
作者 冯志新 钟诚 《计算机工程》 CAS CSCD 北大核心 2004年第11期123-124,共2页
在FP-tree结构的基础上提出了最大频繁模式挖掘算法FP-Max。算法FP-Max只需要两次数据库扫描,挖掘过程不会产生候选项集。实验表明,算法FP-Max在挖掘密集型数据集方面是高效的。
关键词 数据挖掘 关联规则 频繁模式树 最大频繁模式
下载PDF
基于OWSFP-Tree的最大频繁项目集挖掘算法 被引量:5
14
作者 赵志刚 王芳 万军 《计算机工程与设计》 CSCD 北大核心 2013年第5期1687-1690,1807,共5页
为了解决FP-Max算法挖掘最大频繁项目集时递归生成条件模式树和超集检验的问题,提出了基于单向有序FP-Tree的NCFP-Max算法。该算法在挖掘过程中采用预剪枝策略减少挖掘结点,利用单向有序FP-Tree避免每次存储当前挖掘出的频繁项目集之前... 为了解决FP-Max算法挖掘最大频繁项目集时递归生成条件模式树和超集检验的问题,提出了基于单向有序FP-Tree的NCFP-Max算法。该算法在挖掘过程中采用预剪枝策略减少挖掘结点,利用单向有序FP-Tree避免每次存储当前挖掘出的频繁项目集之前都需要超集检验,利用项目表格避免递归生成条件模式树减少时空资源。实验结果表明,在事务条数多、项的数量大的情况下,NCFP-Max算法的挖掘时间比FP-Max算法缩短了50%左右。 展开更多
关键词 数据挖掘 关联规则 最大频繁项目集 FP-tree 条件模式树
下载PDF
基于改进FP-tree的最大频繁项集挖掘算法 被引量:14
15
作者 陈晨 鞠时光 《计算机工程与设计》 CSCD 北大核心 2008年第24期6236-6239,共4页
现有的最大频繁项集挖掘算法在挖掘过程中需要进行超集检测,基于FP-tree的算法需要递归的建立条件频繁模式树,挖掘效率不高。提出了一种基于改进FP-tree高效挖掘最大频繁项集的算法(MMFI)。该算法修改了FP-tree结构并采用NBN策略,在挖... 现有的最大频繁项集挖掘算法在挖掘过程中需要进行超集检测,基于FP-tree的算法需要递归的建立条件频繁模式树,挖掘效率不高。提出了一种基于改进FP-tree高效挖掘最大频繁项集的算法(MMFI)。该算法修改了FP-tree结构并采用NBN策略,在挖掘过程中既不需要进行超集检测也不需要递归的建立条件频繁模式树。算法分析和实验结果表明,该算法是一种有效、快速的算法。 展开更多
关键词 数据挖掘 关联规则 最大频繁项集 超集检测 频繁模式树
下载PDF
基于FP-Tree的快速选择性集成算法 被引量:6
16
作者 赵强利 蒋艳凰 徐明 《软件学报》 EI CSCD 北大核心 2011年第4期709-721,共13页
选择性集成通过选择部分基分类器参与集成,从而提高集成分类器的泛化能力,降低预测开销.但已有的选择性集成算法普遍耗时较长,将数据挖掘的技术应用于选择性集成,提出一种基于FP-Tree(frequent pattern tree)的快速选择性集成算法:CPM-E... 选择性集成通过选择部分基分类器参与集成,从而提高集成分类器的泛化能力,降低预测开销.但已有的选择性集成算法普遍耗时较长,将数据挖掘的技术应用于选择性集成,提出一种基于FP-Tree(frequent pattern tree)的快速选择性集成算法:CPM-EP(coverage based pattern mining for ensemble pruning).该算法将基分类器对校验样本集的分类结果组织成一个事务数据库,从而使选择性集成问题可转化为对事务数据集的处理问题.针对所有可能的集成分类器大小,CPM-EP算法首先得到一个精简的事务数据库,并创建一棵FP-Tree树保存其内容;然后,基于该FP-Tree获得相应大小的集成分类器.在获得的所有集成分类器中,对校验样本集预测精度最高的集成分类器即为算法的输出.实验结果表明,CPM-EP算法以很低的计算开销获得优越的泛化能力,其分类器选择时间约为GASEN的1/19以及Forward-Selection的1/8,其泛化能力显著优于参与比较的其他方法,而且产生的集成分类器具有较少的基分类器. 展开更多
关键词 集成学习 选择性集成 频繁模式树 BAGGING 误差反向传播神经网络
下载PDF
基于FP-tree的最大频繁项目集挖掘算法 被引量:4
17
作者 马丽生 邓辉文 齐逸 《计算机工程与设计》 CSCD 北大核心 2008年第2期385-388,共4页
最大频繁项目集挖掘是数据挖掘领域最重要的基本问题之一,在分析已有算法的基础上提出了FP-MMFI算法,它是对FP-growth算法在最大频繁项目集挖掘上的扩展。提出了频繁路径的概念,用它可以有效地对FP-tree进行压缩和缩小搜索空间,同时使... 最大频繁项目集挖掘是数据挖掘领域最重要的基本问题之一,在分析已有算法的基础上提出了FP-MMFI算法,它是对FP-growth算法在最大频繁项目集挖掘上的扩展。提出了频繁路径的概念,用它可以有效地对FP-tree进行压缩和缩小搜索空间,同时使用投影的方法对超集检测进行了优化,减少了项目匹配的次数。最后实验结果表明,该算法在性能上优于已有的同类算法。 展开更多
关键词 数据挖掘 关联规则 频繁项目集 最大频繁项目集 频繁模式树
下载PDF
针对XML流数据的复杂Twig Pattern查询处理 被引量:9
18
作者 杨卫东 王清明 施伯乐 《软件学报》 EI CSCD 北大核心 2007年第4期893-904,共12页
XML流数据处理在研究领域引起了研究者的广泛兴趣.针对XML流数据的、具有嵌套AND/OR谓词的复杂Twig Pattern查询处理,提出一种新方法.为了提高查询处理性能,将所有Twig Pattern合并为一个共享前缀的查询树,其中,AND/OR谓词被表示为单独... XML流数据处理在研究领域引起了研究者的广泛兴趣.针对XML流数据的、具有嵌套AND/OR谓词的复杂Twig Pattern查询处理,提出一种新方法.为了提高查询处理性能,将所有Twig Pattern合并为一个共享前缀的查询树,其中,AND/OR谓词被表示为单独的抽象语法树,因而能够以文档顺序、单遍地处理复杂Twig Pattern的匹配,并避免了YFilter中对嵌套谓词进行后置处理所产生的中间结果.实验结果表明,该方法能够有效改善Twig Pattern的处理性能,尤其是在处理大文档的情况下.基于已有的研究工作,讨论如何利用DTD(document type definition)的结构和约束信息优化Twig Pattern,即这种优化是在系统运行前进行的预处理. 展开更多
关键词 XML文档流 xPam TWIG pattern 查询树 DTD(document type definition)
下载PDF
基于FP-Tree的最大频繁项目集更新挖掘算法 被引量:5
19
作者 杨君锐 赵群礼 《华中科技大学学报(自然科学版)》 EI CAS CSCD 北大核心 2004年第11期88-90,共3页
发现最大频繁项目集是多种数据挖掘应用中的重要问题 .在应用中用户需要调整最小支持度 ,以发现更有用的最大频繁项目集 .为此提出了一种最大频繁项目集更新算法 (UMFPA) ,该算法通过对频繁模式树(FP Tree)中的频繁项目头表 (HTable)增... 发现最大频繁项目集是多种数据挖掘应用中的重要问题 .在应用中用户需要调整最小支持度 ,以发现更有用的最大频繁项目集 .为此提出了一种最大频繁项目集更新算法 (UMFPA) ,该算法通过对频繁模式树(FP Tree)中的频繁项目头表 (HTable)增加两个域 ,从而将减少在数据库不变而最小支持度变化的情况下的更新挖掘最大频繁项目集的费用 .实验结果表明 。 展开更多
关键词 数据挖掘 关联规则 最大频繁项目集 频繁模式树 最小支持度
下载PDF
一种改进的基于FP-Tree的高效挖掘最大频繁项目集算法 被引量:8
20
作者 尹治华 张大鹏 +1 位作者 谭明 王新生 《济南大学学报(自然科学版)》 北大核心 2017年第2期111-117,共7页
为了解决最大频繁项目集算法DMFIA(discover maximum frequent itemsets algorithm)在挖掘候选项目集维数较大而最大频繁项目集维数较小的情况下产生大量候选项目集的问题,提出一种改进的基于FP-Tree(frequent pattern tree)的最大频繁... 为了解决最大频繁项目集算法DMFIA(discover maximum frequent itemsets algorithm)在挖掘候选项目集维数较大而最大频繁项目集维数较小的情况下产生大量候选项目集的问题,提出一种改进的基于FP-Tree(frequent pattern tree)的最大频繁项目集挖掘的FP-EMFIA算法;该算法在挖掘过程中根据项目头表,采用自上而下和自下而上的双向搜索策略,并通过条件模式基中的频繁项目和较小维数的非频繁项目集对候选项目集进行降维和剪枝,以减少候选项目集的数量,加速对候选集计数的操作。在经典数据集mushroom、chess和connect上的实验结果表明,FP-EMFIA算法在支持度较小时的时间效率优于DMFIA、IDMFIA(improved algorithm of DMFIA)和BDRFI(algorithm for mining frequent itemsets based on decreasing dimensionality reduction of frequent itemsets)算法的,说明FP-EMFIA算法在候选项目集维数较大时有相对优势。 展开更多
关键词 数据挖掘 关联规则 最大频繁项目集 频繁模式树 非频繁项目集
下载PDF
上一页 1 2 45 下一页 到第
使用帮助 返回顶部