期刊文献+

一种新的基于回归分析的异常值检测 被引量:6

Outliers Detection of a New Method Based on Regression Analysis
原文传递
导出
摘要 异常值检测是当前数据分析中的一个重要研究领域.模型中的异常值会直接影响建模、参数的估计、预测等问题.回归分析是应用极其广泛的数据分析方法之一,本文针对回归分析中的异常值检测进行了研究.该方法基于均值转移模型,根据异常值对残差平方和的影响关系构造一个新的异常值判断准则的统计量,并给出了估计异常值大小的公式.本文进行了大量的模拟实验和实例分析,与传统方法相比,结果表明该方法是有效的. Outliers detection is an important research field in the current data analysis.Outliers in the data will affect the modeling,estimating parameters,forecasting and other issues directly.Regression analysis is one of the most widespread application of data analysis methods.We suggest a new method for outliers detection based on regression analysis in this article.Mean-shift model is considered in this paper.A new statistic method detecting outliers according to the relationship between sum of squares for error(SSE)and outliers is proposed.Moreover,the method of estimating the size of those outliers is given.The approach is more effective to outliers compared with traditional methods,which is illustrated with a large number of simulation studies as well as real data.
出处 《河南大学学报(自然科学版)》 CAS 2015年第6期635-639,共5页 Journal of Henan University:Natural Science
基金 国家自然科学基金数学天元基金(11426159) 北京市教育委员会社会科学研究计划项目(SM201310038007)
关键词 回归分析 异常值 最小二乘估计(OLS) 残差平方和(SSE) regression analysis outlier ordinary least square sum of squares for error
  • 相关文献

参考文献12

  • 1Becker C, Gather U. The masking breakdown point of multivariate outliers [J]. J Amer Assoc, 1999, 94(447) : 947--955.
  • 2Lawrence A J. Deletion influence and mashing in regression [J]. Journal of the RoyalStati tical Society, Series B(Methodo~ logical), 1995,57(1) :181--189.
  • 3Andrews D F. A robust method for multiple linear regression [J]. Technometrics, 1974(16): 523--531.
  • 4Mosteller F, Tukey J W. Data analysis and regression., a second course in statistics [M]. Reading.. MA:Addison-Wesley, 1977.
  • 5Devlin S J, Gnanadesikan R, Kettenring J. Robust estimation and outliers detection with correlation coefficients [J]. Bio metrica, 1975 (62).. 531--546.
  • 6Krasker W S, Welsch R E. Efficient bounded influence regression estimation [J]. Journal of American Statistical Associa tion, 1982 (77): 595--604.
  • 7陈希孺,王松佳.近代线性回归-原理方法及应用[M].合肥:安微、教育出版社,1987.
  • 8Draper N R, Smith H. Applied regression analysis[M]. New York: John Wiley, 1981.
  • 9彭珊线.线性回归模型中关于异常点的若干问题的分析[D].黑龙江:东北林业大学,2014.
  • 10Hoeting J, Raftery A E, Madigan D. A method for simultaneousvariable selection and outlier identification in linear re gression [J]. Comput Stat Data Anal, 1996(22) : 252--270.

共引文献1

同被引文献31

引证文献6

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部