期刊文献+

基于均值漂移模型的异常值检测方法 被引量:1

Outlier Detection Method Based on the Mean Shift Model
下载PDF
导出
摘要 由于异常值的存在对统计推断有很大影响,因此异常值检测是数据分析中的一个重要步骤。对于横截面数据的线性模型,改写模型的设计矩阵后,基于均值漂移模型,利用系数压缩估计方法来进行异常值检测。由于系数压缩估计中调节参数的选择对检测效果有很大影响,基于两种调节方法的加权,提出了一种新的调节方法。数值模拟结果表明,使用这种基于均值漂移模型的异常值检测调节方法,可以显著降低犯两种错误的概率。 Outlier detection is an important part for data analysis, since outliers can infect the statistical inference evidently. The design matrix in a linear model for cross sectional data was rewritten, and an outlier detection method was proposed based on the mean shift model by using the coefficient shrink estimation. Because the selection of tuning model parameters is very important for outlier detection, a new tuning method based on a weighted tuning process was proposed. The numerical simulation results show that when the new tuning method is applied in the outlier detection procedure, two false identification probability can be decreased observably.
作者 张探探 樊亚莉 钟先乐 ZHANG Tantan;FAN Yali;ZHONG Xianle(College of Science, University of Shanghai for Science and Technology, Shanghai 200093, Chin)
出处 《上海理工大学学报》 CAS 北大核心 2018年第2期116-120,共5页 Journal of University of Shanghai For Science and Technology
基金 国家自然科学基金资助项目(11401383)
关键词 异常值检测 参数调节 均值漂移模型 系数压缩估计 outlier detection parameter tuning mean shift model coefficient shrink estimation
  • 相关文献

参考文献1

二级参考文献12

  • 1陈希孺,王松佳.近代线性回归-原理方法及应用[M].合肥:安微、教育出版社,1987.
  • 2Becker C, Gather U. The masking breakdown point of multivariate outliers [J]. J Amer Assoc, 1999, 94(447) : 947--955.
  • 3Lawrence A J. Deletion influence and mashing in regression [J]. Journal of the RoyalStati tical Society, Series B(Methodo~ logical), 1995,57(1) :181--189.
  • 4Andrews D F. A robust method for multiple linear regression [J]. Technometrics, 1974(16): 523--531.
  • 5Mosteller F, Tukey J W. Data analysis and regression., a second course in statistics [M]. Reading.. MA:Addison-Wesley, 1977.
  • 6Devlin S J, Gnanadesikan R, Kettenring J. Robust estimation and outliers detection with correlation coefficients [J]. Bio metrica, 1975 (62).. 531--546.
  • 7Krasker W S, Welsch R E. Efficient bounded influence regression estimation [J]. Journal of American Statistical Associa tion, 1982 (77): 595--604.
  • 8Draper N R, Smith H. Applied regression analysis[M]. New York: John Wiley, 1981.
  • 9彭珊线.线性回归模型中关于异常点的若干问题的分析[D].黑龙江:东北林业大学,2014.
  • 10Hoeting J, Raftery A E, Madigan D. A method for simultaneousvariable selection and outlier identification in linear re gression [J]. Comput Stat Data Anal, 1996(22) : 252--270.

共引文献5

同被引文献2

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部