期刊文献+

非参数局部多项式回归估计的最优子抽样算法 被引量:2

Optimal Subsampling Algorithm for Nonparametric Local Polynomial Regression Estimation
原文传递
导出
摘要 随着科学技术的发展,虽然人们提高了收集和处理数据的能力,但仍存在一些大数据集超出了现有计算机的计算能力.目前,抽取一部分样本来替代全样本进行建模计算是减轻计算负担的一种方法.大数据背景下线性模型的子抽样方法已经得到了相对成熟的研究,在减轻计算量方面获得了很大的优势.文章将线性模型下的子抽样方法推广到非参数回归模型,并推导出了基于子样本的加权最小二乘参数估计对全样本加权最小二乘参数估计的收敛速度,以及子样本参数估计的条件渐近正态性.通过最小化渐近方差的准则,提出了非参数局部多项式回归模型下的OPT和PL两种抽样方案,最后在均方误差、计算成本和拟合效果等方面进行数值模拟,比较了OPT子抽样和PL子抽样相对于均匀子抽样和杠杆子抽样的差别,其结果表明于OPT准则和PL准则的子抽样方法在提高估计精确性和减少计算负担方面具有很大优势. In this paper,we extend the subsampling method under the linear model to the nonparametric regression model and propose two subsampling methods for the nonparametric local polynomial regression model.First,we derive the convergence rate of subsampling based weighted least squares parameter estimation to full sample weighted least squares parameter estimation,and the asymptotic normality of the subsample parameter estimation are derived.Then,we use the criterion of minimizing the asymptotic variance,and two subsampling methods of OPT and PL under nonparametric local polynomial regression model are proposed.Finally,numerical simulation of OPT subsampling and PL subsampling,uniform subsampling and Basic Leveraging subsampling are carried out respectively,in terms of mean square error,fitting effect and computational cost.The results show that the subsampling method based on OPT criterion and PL criterion has great advantages in improving estimation accuracy and reducing calculation burden.
作者 牛晓阳 邹家辉 NIU Xiaoyang;ZOU Jiahui(Zhongkai University of Agriculture and Engineering,Guangzhou 510225;Capital University of Economics and Business,Beijing 100070)
出处 《系统科学与数学》 CSCD 北大核心 2022年第1期72-84,共13页 Journal of Systems Science and Mathematical Sciences
基金 首都经济贸易大学北京市属高校基本科研业务费专项资金(XJ2021004601)资助课题。
关键词 局部多项式回归估计 子抽样 加权最小二乘 Local polynomial regression estimation subsampling weighted least squares
  • 相关文献

参考文献5

二级参考文献22

  • 1Breidt, F. J., Opsomer, J. D. Local Polynomial Regression Estimators in Survey Sampling[J].The Annals of Statistics,2000,(2).
  • 2Sarndal E. C., Swensson B., Wretman J. Model Assisted Survey Sampling[M].New York: Springer,1992.
  • 3W.G.科克伦.抽样技术[M].北京:中国统计出版社,1985.
  • 4张小明.应急科技:大数据时代的新进展[N].光明日报,2013-10-14.
  • 5刘建平.辅助信息在抽样调查中的应用模型与方法[M].北京:中国统计出版社,2007.
  • 6Cochran W G著,张尧庭,吴辉译.抽样技术[M].北京:中国统计出版社,1985.
  • 7Deville J C, S~rndal C E. Calibration estimators in survey sampling [J]. Journal of the American Statistical Association, 1992, 87: 376-382.
  • 8Alain Th~berge. Extensions of calibration estimators in survey sampling [J]. Journal of the American Statistical Association, 1999, 94:446 635.
  • 9Royall R M. On finite population sampling theory under certain linear regression models [J]. Biometrika 1970, 57: 37~387.
  • 10Hansen M H, Madow W G, Tepping B J. An evaluation of model-dependent and probability-sampling inferences in sample surveys [J]. Journal of the American Statistical Association, 1983, 78(384): 776- 793.

共引文献363

同被引文献34

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部