期刊文献+

逼真生成表格式数据的非时间属性关联模型

Not-temporal attribute correlation model to generate table data realistically
下载PDF
导出
摘要 针对数据仿真过程中表格数据属性间关联难的问题,提出一种刻画表格数据中非时间属性间关联特征的H模型。首先,从数据集中提取评价主体和被评价主体关键属性,进行两重频数统计,得到关于关键属性的4个关系对;然后,计算各关系对的最大信息系数(MIC)来评估各关系对的相关性,并采用拉伸指数分布(SE)对各关系对进行关系拟合;最后,设置评价主体和被评价主体的数据规模,根据拟合出的关系计算出评价主体的活跃度和被评价主体的流行度,通过活跃度总和等于流行度总和建立关联,得到非时间属性关联的H模型。实验结果表明,利用H模型能有效地刻画真实数据集中非时间属性间的关联特征。 To solve the difficulty of attribute correlation in the process of simulating table data, an H model was proposed for describing not-temporal attribute correlation in table data. Firstly, the key attributes of the evaluation subject and the evaluated subject were extracted from the data set, by the twofold frequency statistics, four relationships of the key attributes were obtained. Then, the Maximum Information Coefficient (MIC) of each relationship was calculated to evaluate the correlation of each relationship, and each relationship was fitted by the Stretched Exponential (SE) distribution. Finally, the data scales of the evaluation subject and the evaluated subject were set. According to the result of fitting, the activity of the evaluation subject was calculated, and the popularity of the evaluated subject was calculated. H model was obtained through the association that was established by equal sum of activity and popularity. The experimental results show that H model can effectively describe the correlation characteristics of the non-temporal attributes in real data sets.
出处 《计算机应用》 CSCD 北大核心 2017年第9期2684-2688,共5页 journal of Computer Applications
基金 福建省科技计划重大项目(2016H6007) 福州市市校合作项目(2016-G-40)~~
关键词 数据仿真 关联 最大信息系数 拉伸指数分布 属性关联 data simulation correlation Maximum Information Coefficient (MIC) Stretched Exponential (SE) distribution attribute correlation
  • 相关文献

参考文献4

二级参考文献132

  • 1胡海波,王林.幂律分布研究简史[J].物理,2005,34(12):889-896. 被引量:87
  • 2于秦,毛玉明.基于PME重尾分布服务时间的M/G/1模型排队性能研究[J].计算机学报,2005,28(12):2103-2108. 被引量:4
  • 3杨璐,吴清亮.自相似网络流量可预测性及其在AQM中的应用[J].计算机工程,2006,32(1):10-12. 被引量:4
  • 4纪其进,董永强.基于小波域混合高斯模型的自相似流量合成算法[J].计算机研究与发展,2006,43(3):389-394. 被引量:2
  • 5Gabrielli A, Caldarelli G. Invasion percolation and critical transient in the Barabasi model of human dynamics [J].Physical Review Letters, 2007, 98(20): 208701.
  • 6Blanchard P, Hongler M O. Modeling human activity in the spirit of Barabasi's queueing systems [J]. Physical Review Letters, 2007, 75(2): 026102.
  • 7Grinstein G, Linsker R. Biased diffusion and universality in model queues [J]. Physical Review Letters, 2006, 97(13): 130201.
  • 8Grinstein G, Linsker R. Power-law and exponential tails in a stochastic priority-based model queue [J].Physical Review Letters, 2008, 77(1): 012101.
  • 9Cajueiro D O, Maldonado W L. Role of optimization in the human dynamics of task execution [J]. Physical Review Letters, 2008, 77(3): 035101.
  • 10邓竹君,张宁,李季明.截止时间对人类动力学模型的影响[C].郭进利,周涛,张宁,等.人类行为动力学模型.香港:上海系统科学出版社,2008:29-34.

共引文献97

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部