期刊文献+

基于有向无环图的函数依赖一致性数据生成

DAG Based Data Generation with Functional Dependencies Consistency
下载PDF
导出
摘要 针对函数依赖一致性数据生成问题,采用有向无环图作为函数依赖集合的描述模型,提出一种单函数依赖一致性数据生成算法(TGSFD);并通过属性排序解决多函数依赖一致性数据生成问题;为了利用流水线技术提高数据生成效率,提出最小独立属性子集概念,并给出了属性集划分算法.实验表明本文提出的TGSFD和属性排序算法能够保证生成的数据满足函数依赖一致性,属性集划分和流水线技术可以有效提高数据生成效率. For data generation problems with functional dependency(FD) consistency,directed acyclic graph(DAG) was used to model FDs set,an algorithm of tuple generation with single FD(TGSFD) was proposed to generate data consistent with single FD,an attributes sorting algorithm was proposed to solve the data generation problems with multi FDs.In order to utilize pipelining technique to improve the efficiency of data generation,a concept of minimal independent attributes subset(MIAS) was proposed and the attributes set partitioning algorithm was given.Experiments results show that TGSFD and attributes sorting algorithm can guarantee the FD consistency of generated data,while MIAS and pipeline technique can improve the efficiency of data generation.
出处 《北京理工大学学报》 EI CAS CSCD 北大核心 2014年第6期592-596,共5页 Transactions of Beijing Institute of Technology
基金 国家自然科学基金资助项目(61070714) 解放军理工大学预研基金资助项目(20110604) 中国博士后科学基金特别资助项目(201003797) 中国博士后科学基金资助项目(20090461425)
关键词 数据生成 一致性 函数依赖 有向无环图 流水线 data generation consistency functional dependency directed acyclic graph pipeline
  • 相关文献

参考文献8

  • 1Alexander K.Generate test data using SQL.IBM developerWorks,2004.http://www.IBM.com/developerworks/data/library/techarticle/dm-0405kuzne-tsov.
  • 2Cong Gao,Fan Wenfei,Floris Geerts,et al.Improving data quality:consistency and accuracy//Proceedings of VLDB\'07.Vienna,Austria:VLDB Endowment,2007:315-326.
  • 3Fan Wenfei,Greets F,Jia Xibei.Conditional functional dependencies for capturing data inconsistencies[J].ACM Transactions on Database Systems,2008,33(2):1-48.
  • 4Houkj?r K,Torp K,Wind R.Simple and realistic data generation//Proceedings of VLDB\'06.Seoul,Korea:VLDB Endowment,2006:1243-1246.
  • 5Talburt J R,Zhou Yinle,Savitha Y S.SOG:a synthetic occupancy generator to support entity resolution//Proceedings of ICIQ\'09.Potsdam,Germany:Hasso Plattner Institute,2009:91-105.
  • 6Chays D.Test data generation for relational database applications.New York:Polytechnic University,2005.
  • 7Date C J.An introduction to database systems[M].7th ed.Boston:Addison-Wesley,2007:331-339.
  • 8Transaction Processing Performance Council(TPC).TPC-C V5.11-2010[S].San Francisco,USA:Transaction Processing Performance Council,2010.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部