基于有向无环图的函数依赖一致性数据生成

DAG Based Data Generation with Functional Dependencies Consistency

下载PDF

导出

摘要针对函数依赖一致性数据生成问题,采用有向无环图作为函数依赖集合的描述模型,提出一种单函数依赖一致性数据生成算法(TGSFD);并通过属性排序解决多函数依赖一致性数据生成问题;为了利用流水线技术提高数据生成效率,提出最小独立属性子集概念,并给出了属性集划分算法.实验表明本文提出的TGSFD和属性排序算法能够保证生成的数据满足函数依赖一致性,属性集划分和流水线技术可以有效提高数据生成效率. For data generation problems with functional dependency（FD） consistency,directed acyclic graph（DAG） was used to model FDs set,an algorithm of tuple generation with single FD（TGSFD） was proposed to generate data consistent with single FD,an attributes sorting algorithm was proposed to solve the data generation problems with multi FDs.In order to utilize pipelining technique to improve the efficiency of data generation,a concept of minimal independent attributes subset（MIAS） was proposed and the attributes set partitioning algorithm was given.Experiments results show that TGSFD and attributes sorting algorithm can guarantee the FD consistency of generated data,while MIAS and pipeline technique can improve the efficiency of data generation.

作者谭明超刁兴春曹建军冯径

机构地区解放军理工大学指挥信息系统学院总参第解放军理工大学气象海洋学院

出处《北京理工大学学报》 EI CAS CSCD 北大核心 2014年第6期592-596,共5页 Transactions of Beijing Institute of Technology

基金国家自然科学基金资助项目(61070714) 解放军理工大学预研基金资助项目(20110604) 中国博士后科学基金特别资助项目(201003797) 中国博士后科学基金资助项目(20090461425)

关键词数据生成一致性函数依赖有向无环图流水线 data generation consistency functional dependency directed acyclic graph pipeline

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献8

1Alexander K.Generate test data using SQL.IBM developerWorks,2004.http://www.IBM.com/developerworks/data/library/techarticle/dm-0405kuzne-tsov.
2Cong Gao,Fan Wenfei,Floris Geerts,et al.Improving data quality:consistency and accuracy//Proceedings of VLDB\'07.Vienna,Austria:VLDB Endowment,2007:315-326.
3Fan Wenfei,Greets F,Jia Xibei.Conditional functional dependencies for capturing data inconsistencies[J].ACM Transactions on Database Systems,2008,33(2):1-48.
4Houkj?r K,Torp K,Wind R.Simple and realistic data generation//Proceedings of VLDB\'06.Seoul,Korea:VLDB Endowment,2006:1243-1246.
5Talburt J R,Zhou Yinle,Savitha Y S.SOG:a synthetic occupancy generator to support entity resolution//Proceedings of ICIQ\'09.Potsdam,Germany:Hasso Plattner Institute,2009:91-105.
6Chays D.Test data generation for relational database applications.New York:Polytechnic University,2005.
7Date C J.An introduction to database systems[M].7th ed.Boston:Addison-Wesley,2007:331-339.
8Transaction Processing Performance Council(TPC).TPC-C V5.11-2010[S].San Francisco,USA:Transaction Processing Performance Council,2010.

1白冬辉,张涛,魏昕宇.基于属性度的属性排序算法[J].计算机工程与应用,2017,53(5):64-68.
2周元珂,邵峰晶,吴舜尧.评估属性层知识和实例层知识融合效果的有效指标[J].青岛大学学报（自然科学版）,2014,27(2):29-33.
3王之欢.时日自动标备份不难找[J].电脑爱好者,2014,0(24):44-44.
4李紧,苏伟,陈敏.基于QoS和用户偏好的Web服务发现模型[J].现代计算机,2010,16(4):49-52. 被引量：7
5杜瑞杰,王双成,高瑞.基于高斯密度的一阶贝叶斯衍生分类器[J].计算机应用研究,2015,32(11):3242-3246. 被引量：1
6顿煜卿,陈利,陈强,刘灵敏.一种改进的BP神经网络属性选择方法[J].计算机应用研究,2009,26(7):2659-2660. 被引量：3
7田俊峰,王惠然,刘玉玲.基于属性排序的入侵特征缩减方法研究[J].计算机研究与发展,2006,43(z2):565-569. 被引量：2
8钟蜜,方勇,邓赟.基于XML/HTML标签属性的信息隐藏模型[J].通信技术,2010,43(5):106-108. 被引量：4
9张巧艳,郑丽英,张晨阳.粗集数据挖掘技术在市场营销中的应用[J].兵工自动化,2006,25(6):37-38.
10杨佳怿.基于软集理论驱动的毕业生推荐系统研究[J].河南师范大学学报（自然科学版）,2015,43(4):157-162.

北京理工大学学报

2014年第6期

浏览历史

内容加载中请稍等...

基于有向无环图的函数依赖一致性数据生成

参考文献8

相关作者

相关机构

相关主题

浏览历史