期刊文献+
共找到420篇文章
< 1 2 21 >
每页显示 20 50 100
Database Encoding and A New Algorithm for Association Rules Mining
1
作者 Tong Wang Pilian He 《通讯和计算机(中英文版)》 2006年第3期77-81,共5页
下载PDF
A New Hybrid Algorithm for Association Rule Mining 被引量:1
2
作者 张敏聪 燕存良 朱开玉 《Journal of Donghua University(English Edition)》 EI CAS 2007年第5期598-603,共6页
HA(hashing array),a new algorithm,for mining frequent itemsets of large database is proposed.It employs a structure hash array,ItemArray() to store the information of database and then uses it instead of database in l... HA(hashing array),a new algorithm,for mining frequent itemsets of large database is proposed.It employs a structure hash array,ItemArray() to store the information of database and then uses it instead of database in later iteration.By this improvement,only twice scanning of the whole database is necessary,thereby the computational cost can be reduced significantly.To overcome the performance bottleneck of frequent 2-itemsets mining,a modified algorithm of HA,DHA(direct-addressing hashing and array) is proposed,which combines HA with direct-addressing hashing technique.The new hybrid algorithm,DHA,not only overcomes the performance bottleneck but also inherits the advantages of HA.Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm,and the results prove the new algorithm is more efficient and reasonable. 展开更多
关键词 数据挖掘 散列法 数据库 混合算法 联合规则挖掘
下载PDF
基于网络环境的分布式KDD及Data Mining研究 被引量:6
3
作者 何炎祥 彭锋 +2 位作者 宋文欣 熊汉卫 陈莘萌 《小型微型计算机系统》 CSCD 北大核心 1999年第10期744-746,共3页
本文针对KDD 的研究现状及其面临的挑战,主要讨论了基于网络环境下,面向多个站点机、多种数据库、多类数据源的分布式KDD 和Data Mining 的整体方案和实验系统模型,研究内容包括高效分布式开采算法,KDD 过程的... 本文针对KDD 的研究现状及其面临的挑战,主要讨论了基于网络环境下,面向多个站点机、多种数据库、多类数据源的分布式KDD 和Data Mining 的整体方案和实验系统模型,研究内容包括高效分布式开采算法,KDD 过程的无缝集成,KDD 中的知识表示。 展开更多
关键词 知识发现 数据开采 知识表示 可视化 数据库系统
下载PDF
Designing a Model to Study Data Mining in Distributed Environment
4
作者 Md. Abadur Rahman Masud Karim 《Journal of Data Analysis and Information Processing》 2021年第1期23-29,共7页
To make business policy, market analysis, corporate decision, fraud detection, etc., we have to analyze and work with huge amount of data. Generally, such data are taken from different sources. Researchers are using d... To make business policy, market analysis, corporate decision, fraud detection, etc., we have to analyze and work with huge amount of data. Generally, such data are taken from different sources. Researchers are using data mining to perform such tasks. Data mining techniques are used to find hidden information from large data source. Data mining is using for various fields: Artificial intelligence, Bank, health and medical, corruption, legal issues, corporate business, marketing, etc. Special interest is given to associate rules, data mining algorithms, decision tree and distributed approach. Data is becoming larger and spreading geographically. So it is difficult to find better result from only a central data source. For knowledge discovery, we have to work with distributed database. On the other hand, security and privacy considerations are also another factor for de-motivation of working with centralized data. For this reason, distributed database is essential for future processing. In this paper, we have proposed a framework to study data mining in distributed environment. The paper presents a framework to bring out actionable knowledge. We have shown some level by which we can generate actionable knowledge. Possible tools and technique for these levels are discussed. 展开更多
关键词 data mining Distributed database knowledge discovery Classification Algorithm
下载PDF
Elicitation of Association Rules from Information on Customs Offences on the Basis of Frequent Motives
5
作者 Bi Bolou Zehero Etienne Soro +2 位作者 Yake Gondo Pacome Brou Olivier Asseu 《Engineering(科研)》 2018年第9期588-605,共18页
The fight against fraud and trafficking is a fundamental mission of customs. The conditions for carrying out this mission depend both on the evolution of economic issues and on the behaviour of the actors in charge of... The fight against fraud and trafficking is a fundamental mission of customs. The conditions for carrying out this mission depend both on the evolution of economic issues and on the behaviour of the actors in charge of its implementation. As part of the customs clearance process, customs are nowadays confronted with an increasing volume of goods in connection with the development of international trade. Automated risk management is therefore required to limit intrusive control. In this article, we propose an unsupervised classification method to extract knowledge rules from a database of customs offences in order to identify abnormal behaviour resulting from customs control. The idea is to apply the Apriori principle on the basis of frequent grounds on a database relating to customs offences in customs procedures to uncover potential rules of association between a customs operation and an offence for the purpose of extracting knowledge governing the occurrence of fraud. This mass of often heterogeneous and complex data thus generates new needs that knowledge extraction methods must be able to meet. The assessment of infringements inevitably requires a proper identification of the risks. It is an original approach based on data mining or data mining to build association rules in two steps: first, search for frequent patterns (support >= minimum support) then from the frequent patterns, produce association rules (Trust >= Minimum Trust). The simulations carried out highlighted three main association rules: forecasting rules, targeting rules and neutral rules with the introduction of a third indicator of rule relevance which is the Lift measure. Confidence in the first two rules has been set at least 50%. 展开更多
关键词 data mining Customs Offences Unsupervised Method Principle of Apriori Frequent Motive rule of association Extraction of knowledge
下载PDF
Spatial data mining and visualization based on self-organizing map
6
作者 LIU Shu-ying OUYANG Hong-ji PENG Fang 《通讯和计算机(中英文版)》 2008年第12期55-60,共6页
关键词 空间数据分析 数据挖掘 可视化系统 分析方法
下载PDF
Research on Employment Data Mining for Higher Vocational Graduates
7
作者 Feng Lin 《International Journal of Technology Management》 2014年第7期78-80,共3页
关键词 数据挖掘技术 就业指导 毕业生 高职 APRIORI算法 数据预处理方法 关联规则 管理决策
下载PDF
An Overview of Data Mining and Knowledge Discovery 被引量:8
8
作者 范建华 李德毅 《Journal of Computer Science & Technology》 SCIE EI CSCD 1998年第4期348-368,共21页
With massive amounts of data stored in databases, mining information and knowledge in databases has become an important issue in recent research. Researchers in many different fields have shown great interest in data ... With massive amounts of data stored in databases, mining information and knowledge in databases has become an important issue in recent research. Researchers in many different fields have shown great interest in data mining and knowledge discovery in databases. Several emerging applications in information providing services, such as data warehousing and on-line services over the Internet, also call for various data mining and knowledge discovery techniques to understand user behavior better, to improve the service provided, and to increase the business opportunities. In response to such a demand, this article is to provide a comprehensive survey on the data mining and knowledge discovery techniques developed recently, and introduce some real application systems as well. In conclusion, this article also lists some problems and challenges for further research. 展开更多
关键词 knowledge discovery in databases data mining machine learning association rule CLASSIFICATION data clustering data generalization pattern searching
原文传递
A Fast Algorithm for Mining Association Rules 被引量:17
9
作者 黄刘生 陈华平 +1 位作者 王洵 陈国良 《Journal of Computer Science & Technology》 SCIE EI CSCD 2000年第6期619-624,共6页
In this paper, the problem of discovering association rules between items in a large database of sales transactions is discussed, and a novel algorithm, BitMatrix, is proposed. The proposed algorithm is fundamentally ... In this paper, the problem of discovering association rules between items in a large database of sales transactions is discussed, and a novel algorithm, BitMatrix, is proposed. The proposed algorithm is fundamentally different from the known algorithms Apriori and AprioriTid. Empirical evaluation shows that the algorithm outperforms the known ones for large databases. Scale-up experiments show that the algorithm scales linearly with the number of transactions. 展开更多
关键词 database data mining large itemset association rule minimum support minimum confidence
原文传递
An improved association-mining research for exploring Chinese herbal property theory: based on data of the Shennong's Classic of Materia Medica 被引量:15
10
作者 Rui Jin Zhi-jian Lin +1 位作者 Chun-miao Xue Bing Zhang 《Journal of Integrative Medicine》 SCIE CAS CSCD 2013年第5期352-365,共14页
Knowledge Discovery in Databases is gaining attention and raising new hopes for traditional Chinese medicine (TCM) researchers. It is a useful tool in understanding and deciphering TCM theories. Aiming for a better ... Knowledge Discovery in Databases is gaining attention and raising new hopes for traditional Chinese medicine (TCM) researchers. It is a useful tool in understanding and deciphering TCM theories. Aiming for a better understanding of Chinese herbal property theory (CHPT), this paper performed an improved association rule learning to analyze semistructured text in the book entitled Shennong's Classic of Materia Medica. The text was firstly annotated and transformed to well-structured multidimensional data. Subsequently, an Apriori algorithm was employed for producing association rules after the sensitivity analysis of parameters. From the confirmed 120 resulting rules that described the intrinsic relationships between herbal property (qi, flavor and their combinations) and herbal efficacy, two novel fundamental principles underlying CHPT were acquired and further elucidated: (1) the many-to-one mapping of herbal efficacy to herbal property; (2) the nonrandom overlap between the related efficacy of qi and flavor. This work provided an innovative knowledge about CHPT, which would be helpful for its modern research. 展开更多
关键词 traditional Chinese medicine Chinese herbal property theory association rulelearning knowledge discovery data mining
下载PDF
A New Approach for Knowledge Discovery in Distributed Databases Using Fragmented Data Storage Model
11
作者 Masoud Pesaran Behbahani Islam Choudhury Souheil Khaddaj 《Chinese Business Review》 2013年第12期834-845,共12页
关键词 数据存储模型 分布式数据库 知识发现 决策支持系统 智能化管理 分散 决策支持模型 智能方法
下载PDF
An Experimental Analysis of the Applications of Datamining Methods on Bigdata
12
作者 CH.Naga Santhosh Kumar K.S.Reddy 《Journal of Autonomous Intelligence》 2019年第3期31-39,共9页
Data mining is a procedure of separating covered up,obscure,however possibly valuable data from gigantic data.Huge Data impactsly affects logical disclosures and worth creation.Data mining(DM)with Big Data has been br... Data mining is a procedure of separating covered up,obscure,however possibly valuable data from gigantic data.Huge Data impactsly affects logical disclosures and worth creation.Data mining(DM)with Big Data has been broadly utilized in the lifecycle of electronic items that range from the structure and generation stages to the administration organize.A far reaching examination of DM with Big Data and a survey of its application in the phases of its lifecycle won't just profit scientists to create solid research.As of late huge data have turned into a trendy expression,which constrained the analysts to extend the current data mining methods to adapt to the advanced idea of data and to grow new scientific procedures.In this paper,we build up an exact assessment technique dependent on the standard of Design of Experiment.We apply this technique to assess data mining instruments and AI calculations towards structure huge data examination for media transmission checking data.Two contextual investigations are directed to give bits of knowledge of relations between the necessities of data examination and the decision of an instrument or calculation with regards to data investigation work processes. 展开更多
关键词 data mining Big data knowledge discovery databases Decision Tree Cloud data mining K-Closest Neighbor Artificial intelligence CLUSTER
下载PDF
Generalized Multidimensional Association Rules
13
作者 周傲英 周水庚 +1 位作者 金文 田增平 《Journal of Computer Science & Technology》 SCIE EI CSCD 2000年第4期388-392,共5页
The problem of association rule mining has gained considerableprominence in the data mining community for its use as an important tool of knowledge discovery from large-scale databases. And there has been a spurt of r... The problem of association rule mining has gained considerableprominence in the data mining community for its use as an important tool of knowledge discovery from large-scale databases. And there has been a spurt of researchactivities around this problem. Traditional association rule mining is limited tointratransaction. Only recently the concept of N-dimensional inter-transaction association rule (NDITAR) was proposed by H.J. Lu. This paper modifies and extendsLu's definition of NDITAR based on the analysis of its limitations, and the generalized multidimensional association rule (GMDAR) is subsequently introduced, whichis more general, flexible and reasonable than NDITAR. 展开更多
关键词 multidimensional transaction database data mining Ndimensionalinter-transaction association rules (NDITAR) generalized multidimensional association rules (GMDAR)
原文传递
Integrated Solution for Discovery of Literature Information Knowledge
14
作者 徐慧 《International Journal of Mining Science and Technology》 SCIE EI 2000年第2期91-94,共4页
An integrated solution for discovery of literature information knowledge is proposed. The analytic model of literature Information model and discovery of literature information knowledge are illustrated. Practical ill... An integrated solution for discovery of literature information knowledge is proposed. The analytic model of literature Information model and discovery of literature information knowledge are illustrated. Practical illustrative example for discovery of literature information knowledge is given. 展开更多
关键词 knowledge discovery data mining data WAREHOUSE database Weh
下载PDF
Taxpayers Fraudulent Behavior Modeling The Use of Datamining in Fiscal Fraud Detecting Moroccan Case
15
作者 Farid Ameur Mohamed Tkiouat 《Applied Mathematics》 2012年第10期1207-1213,共7页
The fraudulent behavior of taxpayers impacts negatively the resources available to finance public services. It creates distortions of competition and inequality, harming honest taxpayers. Such behavior requires the go... The fraudulent behavior of taxpayers impacts negatively the resources available to finance public services. It creates distortions of competition and inequality, harming honest taxpayers. Such behavior requires the government intervention to bring order and establish a fiscal justice. This study emphasizes the determination of the interactions linking taxpayers with tax authorities. We try to see how fiscal audit can influence taxpayers’ fraudulent behavior. First of all, we present a theoretical study of a model pre established by other authors. We have released some conditions of this model and we have introduced a new parameter reflecting the efficiency of tax control;we found that the efficiency of a fiscal control have an important effect on these interactions. Basing on the fact that the detection of fraudulent taxpayers is the most difficult step in fiscal control, We established a new approach using DATA MINING process in order to improve fiscal control efficiency. We found results that reflect fairly the conduct of taxpayers that we have tested based on actual statistics. The results are reliable. 展开更多
关键词 TAX FRAUD TAX EVASION data mining knowledge discovery in databases (KDD) FISCAL Policy FISCAL Reform FISCAL Control FISCAL Justice TAXPAYERS TAX Administration
下载PDF
Mining φ-Frequent Itemset Using FP-Tree
16
作者 李天瑞 《Journal of Modern Transportation》 2001年第1期67-74,共8页
The problem of association rule mining has gained considerable prominence in the data mining community for its use as an important tool of knowledge discovery from large scale databases. And there has been a spurt of... The problem of association rule mining has gained considerable prominence in the data mining community for its use as an important tool of knowledge discovery from large scale databases. And there has been a spurt of research activities around this problem. However, traditional association rule mining may often derive many rules in which people are uninterested. This paper reports a generalization of association rule mining called φ association rule mining. It allows people to have different interests on different itemsets that arethe need of real application. Also, it can help to derive interesting rules and substantially reduce the amount of rules. An algorithm based on FP tree for mining φ frequent itemset is presented. It is shown by experiments that the proposed methodis efficient and scalable over large databases. 展开更多
关键词 data processing databaseS φ association rule mining φ frequent itemset FP tree data mining
下载PDF
Human Body Information Database and Method of Application
17
作者 ZHANG Li LI Yanmei 《International English Education Research》 2016年第7期112-114,共3页
下载PDF
An efficient algorithm for mining closed itemsets 被引量:1
18
作者 刘君强 潘云鹤 《Journal of Zhejiang University Science》 CSCD 2004年第1期8-15,共8页
This paper presents a new efficient algorithm for mining frequent closed itemsets. It enumerates the closed set of frequent itemsets by using a novel compound frequent itemset tree that facilitates fast growth and eff... This paper presents a new efficient algorithm for mining frequent closed itemsets. It enumerates the closed set of frequent itemsets by using a novel compound frequent itemset tree that facilitates fast growth and efficient pruning of search space. It also employs a hybrid approach that adapts search strategies, representations of projected transaction subsets, and projecting methods to the characteristics of the dataset. Efficient local pruning, global subsumption checking, and fast hashing methods are detailed in this paper. The principle that balances the overheads of search space growth and pruning is also discussed. Extensive experimental evaluations on real world and artificial datasets showed that our algorithm outperforms CHARM by a factor of five and is one to three orders of magnitude more efficient than CLOSET and MAFIA. 展开更多
关键词 散列法 知识发现 全局包含检验 搜索策略
下载PDF
Structural choice based on knowledge discovery system
19
作者 XING Fangliang(邢方亮) +1 位作者 WANG Guangyuan(王光远) 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2002年第3期263-266,共4页
Structural choice is a significant decision having an important influence on structural function, social economics, structural reliability and construction cost. A Case Based Reasoning system with its retrieval part c... Structural choice is a significant decision having an important influence on structural function, social economics, structural reliability and construction cost. A Case Based Reasoning system with its retrieval part constructed with a KDD subsystem, is put forward to make a decision for a large scale engineering project. A typical CBR system consists of four parts: case representation, case retriever, evaluation, and adaptation. A case library is a set of parameterized excellent and successful structures. For a structural choice, the key point is that the system must be able to detect the pattern classes hidden in the case library and classify the input parameters into classes properly. That is done by using the KDD Data Mining algorithm based on Self Organizing Feature Maps (SOFM), which makes the whole system more adaptive, self organizing, self learning and open. 展开更多
关键词 knowledge discovery in database data mining SELF-ORGANIZinG feature MAPS STRUCTURAL CHOICE
下载PDF
Method and Application of Comprehensive Knowledge Discovery
20
作者 SHAZongyao BIANFuling 《Geo-Spatial Information Science》 2003年第3期48-55,共8页
This paper proposes the principle of comprehensive knowledge discovery. Unlike most of the current knowledge discovery methods, the comprehensive knowledge discovery considers both the spatial relations and attributes... This paper proposes the principle of comprehensive knowledge discovery. Unlike most of the current knowledge discovery methods, the comprehensive knowledge discovery considers both the spatial relations and attributes of spatial entities or objects. We introduce the theory of spatial knowledge expression system and some concepts including comprehensive knowledge discovery and spatial union information table (SUIT). In theory, SUIT records all information contained in the studied objects, but in reality, because of the complexity and varieties of spatial relations, only those factors of interest to us are selected. In order to find out the comprehensive knowledge from spatial databases, an efficient comprehensive knowledge discovery algorithm called recycled algorithm (RAR) is suggested. 展开更多
关键词 综合知识发现 知识发现算法 空间结合规则 知识表达系统 数字采矿
下载PDF
上一页 1 2 21 下一页 到第
使用帮助 返回顶部