期刊文献+

Histogram-kernel Error and Its Application for Bin Width Selection in Histograms 被引量:1

Histogram-kernel Error and Its Application for Bin Width Selection in Histograms
原文传递
导出
摘要 Histogram and kernel estimators are usually regarded as the two main classical data-based nonparametric tools to estimate the underlying density functions for some given data sets. In this paper we will integrate them and define a histogram-kernel error based on the integrated square error between histogram and binned kernel density estimator, and then exploit its asymptotic properties. 3ust as indicated in this paper, the histogram-kernel error only depends on the choice of bin width and the data for the given prior kernel densities. The asymptotic optimal bin width is derived by minimizing the mean histogram-kernel error. By comparing with Scott's optimal bin width formula for a histogram, a new method is proposed to construct the data-based histogram without knowledge of the underlying density function. Monte Carlo study is used to verify the usefulness of our method for different kinds of density functions and sample sizes. Histogram and kernel estimators are usually regarded as the two main classical data-based nonparametric tools to estimate the underlying density functions for some given data sets. In this paper we will integrate them and define a histogram-kernel error based on the integrated square error between histogram and binned kernel density estimator, and then exploit its asymptotic properties. 3ust as indicated in this paper, the histogram-kernel error only depends on the choice of bin width and the data for the given prior kernel densities. The asymptotic optimal bin width is derived by minimizing the mean histogram-kernel error. By comparing with Scott's optimal bin width formula for a histogram, a new method is proposed to construct the data-based histogram without knowledge of the underlying density function. Monte Carlo study is used to verify the usefulness of our method for different kinds of density functions and sample sizes.
出处 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2012年第3期607-624,共18页 应用数学学报(英文版)
基金 Supported by the National Natural Science Foundation of China (No. 70371018, 70572074)
关键词 HISTOGRAM binned kernel density estimator bin width histogram-kernel error integrated square error Histogram binned kernel density estimator bin width histogram-kernel error integrated square error
  • 相关文献

参考文献21

  • 1Beer, C.F., Swanepoel, J.W.H. Simple and effective number-of-bins circumference selectors for a histogram. Statistics and Computing, 9:27-35 (1999).
  • 2Bowman, A.W. An alternative method of cross-validation for the smoothing of density estimates. Biometrika, 71:353-360 (1984).
  • 3Cencov, N.N. Estimation of an unknown distribution density from observations. Soviet Math., 3:1159-1562 (1962).
  • 4Daly, J.E. The construction of optimal histogram. Commun. Statist. Theory Meth., 17(9): 2921-2931 (1988).
  • 5Devroye, L. The double kernel method in density estimation, Annales de L'Institut Henri Poincare, 25: 533-580 (1989).
  • 6Faraway, J.J., Jhun, M. Bootstrap choice of bandwidth for density estimation. Journal of Statistical Planning and Inference, 85:1119-1122 (1990).
  • 7Freedman, D., Diaconis, P. On the histogram as a density estimation: L2 theory. Zeitschrift fur Wahrschein- lichkeitstheorie und verwandte Gebiete, 57:453-476 (1981).
  • 8He, K., Meeden, G. Selecting the number of bins in a histogram: A decision theoretical approach. Journal of Statistical Planning and Inference, 61:49-59 (1997).
  • 9Parzen, E. Nonparametric statistical data modeling (with discussion). Journal of the American Statistical Association, 74:105-131 (1979).
  • 10Rosenblatt, M. Remarks on some nonparametric estimates of a density function. Annals of Mathematical Statistics, 27:832-837 (1956).

同被引文献27

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部