期刊文献+

一种基于交替投影的脏数据处理方法

A new dirty data processing method based on alternative projection
下载PDF
导出
摘要 由于硬件、软件或传输故障等,用于流量矩阵估计的简单网络管理协议(Simple Network Mamagement Protocol,SNMP)数据可能包含脏数据,从而影响流量矩阵的精度.针对这个问题,提出一种基于SNMP的脏数据处理模型,摆脱了原有SNMP脏数据处理需要源-目的节点对间流量大规模测量的限制.基于交替投影方法,对此模型提出求得L0范数最小的稀疏脏数据处理方法.该算法降低了网络测量开销和时间复杂度,易于实现.实验表明,该算法对脏数据校正也有较高精度. Due to the hardware/software/transmission faults, the SNMP date used in traffic matrices estimating might contain dirty date and impact its accuracy. To solve this problem, a SNMP-self-based model, which was free for largely traffic measurement of Origin/Destination pairs in previous SNMP dirty data transaction, was pres- ented in this paper. Furthermore, based on alternative projection, a sparse dirty data processing algorithm was proposed to get L0 norm minimized Solution. It has lower measuring overhead and calculation complexity. Also, it is adaptive to applicationwidely. Besides, it can get high precision.
出处 《江苏科技大学学报(自然科学版)》 CAS 北大核心 2009年第6期527-530,共4页 Journal of Jiangsu University of Science and Technology:Natural Science Edition
基金 国家自然科学基金资助项目(60573063 60573064)
关键词 稀疏脏数据 交替投影 L0范数最小解 sparse dirty data ahernating projection L0 norm minimized solution
  • 相关文献

参考文献16

  • 1Zhao Q, Kumar A, Wang J, et al. Data streaming algorithms for accurate and efficient measurement of traffic and flow matrices[J]. ACM SIGMETRICS Performance Evaluation Review, 2005,33 ( 1 ) : 350 - 361.
  • 2Zhang Y, Roughan M, Lund C,et al. An information-theoretic approach to traffic matrix estimation [ C ]//In Proc ACM SIGCOMM. New York : ACM,2003:301 - 312.
  • 3Zhang Y, Roughan M, Duffield N, et al. Fast accurate computation of large-scale IP traffic matrices from link loads [ J ]. ACM S1GMETRICS Performance Evaluation Review,2003, 31 ( 1 ) :206 -217.
  • 4Vardi Y. Network tomography: estimating source-destination traffic intensities from link data[ J]. Journal of the American Statistical Association, 1996, 91 (433): 365 - 377.
  • 5Soule A, Lakhina A, Taft N, et al. Traffic matrices : balancing measurements, inference and modeling [ J ]. ACM SIGMETRICS Performance Evaluation Review, 2005, 33 ( 1 ) :362 - 373.
  • 6Papagiannaki K, Taft N, Lakhina A. A distributed approach to measure IP traffic matrices [ C ]//In Proc ACM IMC. New York:ACM, 2004:161 - 174.
  • 7Zhang Y, Ge Z H, Greeenberg A, et al. Network anomography[ C]//Proceedings of the 5th ACM SIGCOMM Conference on Internet Measurement. Berkeley:USENIX Association, 2005.
  • 8周静静,杨家海,杨扬,张辉.流量矩阵估算的研究[J].软件学报,2007,18(11):2669-2682. 被引量:16
  • 9Zhao Qi, Ge Zihui, Wang Jia, et al. Robust traffic matrix estimation with imperfect information: making use of multiple data sources [ J ]. ACM SIGMETRICS Performance Evaluation Review, 2006,34( 1 ) : 133 - 144.
  • 10Donoho D L. For most large undetermined systems of linear equations the minimal 11-norm solution is also the sparsest solution [ J ]. Pure Appl Math, 2006,59:797 - 829.

二级参考文献9

  • 1刘紫千,陈常嘉.基于流量矩阵估计的路由推断算法[J].铁道学报,2005,27(6):66-70. 被引量:3
  • 2[1]RAHM E,DO H H.Data cleaning:problems and current approaches[C]Proceedings of IEEE Data Engineering Bulletin.Germany:Leipzig University,2000.
  • 3[2]BRAHA D,SHMILOVICI A.Data mining for improving a cleaning process in the semiconductor industry[J].IEEE Tran Semicon duct Manufact.California:University of California at Berkeley,2002,5(2):91-101.
  • 4[3]ELOVICI Y,BRAHA D.A decision-theoretic approach to data mining.IEEE Transactions on Systems,Man and Cybernetics,2003,33(1):1-10.
  • 5[4]HERNANDEZ M,STOIFO S.The merge/purge problem for large databases[C]Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:Columbia University,1995(5):127-138.
  • 6[5]MAURICIO A H.Real-World Data is Dirty:Data Cleansing and the merge/purge problems[J].Journal of Data Mining and Knowledge Discovery,1998,2:2-31.
  • 7[6]MONGE A E.ELKAN C P.An efficient domain-independent algorithm for detecting approximately duplicate database records[C]∥Proceedings of SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery.New York:Columbia University,2000.
  • 8[7]MONGE A.Matching algorithms within a duplicate detection system[C]∥Proceedings of IEEE Data Engineering Bulletin.California:California State University,2000.
  • 9赵国锋,王灵矫,唐红,程代杰.基于IP/MPLS网络的动态业务流量矩阵测量模型[J].通信学报,2003,24(10):145-152. 被引量:5

共引文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部