期刊文献+

一种新颖的基于密度的祛噪声方法 被引量:1

A Novel Algorithm for Outlier Removal Based on Density
下载PDF
导出
摘要 由于采集技术和设备的局限,以及外界的各种干扰,采集得到的数据中常常参杂着噪声,直接影响着后续数据分析的结果.传统的祛噪声方法,或是依赖于数据服从某一特定分布的假设,或是只能对服从单一分布的数据进行祛噪声处理,这些固有的缺陷大大降低了处理后数据的可信度.本文提出了一种新颖的基于密度的祛噪声方法,并应用在实际交通数据的处理中.通过与传统方法的实验比较,结果表明该方法摒除了传统方法的缺陷,能够对服从多个相异分布的数据进行有效的祛噪声处理,且处理后的数据能够很好地保留系统本质的特征. Due to the limitation of the present techniques and facilities for data collection and various interferences, the data obtained are often distorted and noised, directly influencing the result of subsequent data analysis. The conventional approaches to outlier removal either assume that the data follow a certain known distribution or deal with the data that are from a single distribution, resulting in a reduced credibility of the data processed. This paper proposes a novel method to remove outliers based on density estimation and it has been applied to real-world traffic data. By comparison with the conventional approach, the experimental results indicate that the proposed algorithm is capable of detecting and removing outliers effectively for the data that may follow different unknown distributions, and the processed data retain the original and significant characteristics possessed by the system.
作者 王扬
出处 《自动化学报》 EI CSCD 北大核心 2010年第2期343-346,共4页 Acta Automatica Sinica
关键词 祛噪声 密度估算 期望值 噪声识别 行驶速度 Outlier removal density estimation expectation outlier detection travel speed
  • 相关文献

参考文献10

  • 1Taylor J R. An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements (Second Edition). Sausolito: University Science Books, 1997. 120-187.
  • 2Bevington P R, Robinson D K. Data Reduction and Error Analysis for the Physical Sciences (Third Edition). New York: McGraw Hill, 2002.
  • 3Rousseeuw P J, Ruts I, Tukey J W. The bagplot: a bivariate boxplot. The American Statistician, 1999, 53(4): 382-387.
  • 4Fornasini P. The Uncertainty in Physical Measurements: An Introduction to Data Analysis in the Physics Laboratory. New York: Springer, 2008.
  • 5He Z Y, Xu X F, Deng S C. Discovering cluster-based local outliers. Pattern Recognition Letters, 2003, 24(9-10): 1641-1650.
  • 6Fu L, Medico E. FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC Bioinformatics, 2007, 8(3): 1-15.
  • 7Ferrer-i-Cancho R. The Euclidean distance between syntactically linked words. Physical Review E, 2004, 70(5): 1-5.
  • 8Krause E F. Taxicab Geometry. New York: Dover, 1987. 63-89.
  • 9Huang W, Shi Y Y, Zhang S Y, Zhu Y F. The communication complexity of the Hamming distance problem. Information Processing Letters, 2006, 99(4): 149-153.
  • 10Li Y J, Liu B. A normalized Levenshtein distance metric. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(6): 1091-1095.

同被引文献13

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部