期刊文献+

简单子抽样多元双样本检验的改进方法

Improved Simple Subsampling Based on the Nearest Neighbor Method
下载PDF
导出
摘要 基于简单子抽样多元双样本检验方法,提出一种改进的检验方法。改进的方法一方面对混合样本进行集成子抽样,既达到平衡样本容量的目的,又尽可能地保留所有样本点的信息;另一方面,在检验统计量的构造中根据样本的非平衡度,采用加权调整的策略,进一步减小样本非平衡度对检验结果的影响。 In this article,we propose an improved simple subsampling based on the nearest neighbor method to solve this problem. The new testing procedure achieves the goal of balancing data as well as to retain the information by using ensemble subsampling scheme. On the other hand,the new method has further increasing testing power by combining the weighting adjustment.
机构地区 河海大学理学院
出处 《江南大学学报(自然科学版)》 CAS 2015年第5期652-658,共7页 Joural of Jiangnan University (Natural Science Edition) 
基金 国家自然科学基金项目(51379064) 江苏省自然科学基金项目(1014-51314411)
关键词 非平衡 双样本检验 KNN算法 子抽样 unbalanced two-sample tests k-nearest neighbor algorithm subsampling
  • 相关文献

参考文献14

  • 1Bickel P J.A distribution free version of the Smirnov two sample test in the p-variate case[J].The Annals of Mathematical Statistics,1969,40(1):1-23.
  • 2Friedman J H,Rafsky L C.Multivariate generalizations of the wald wolfowitz and smimov two-sample tests[J].The Annals of Statistics,1979,7(4):697-717.
  • 3Schilling M F.Multivariate two-sample tests based on nearest neighbors[J].Journal of the American Statistical Association,1986,81(395):799-806.
  • 4Rosenbaum P R.An exact distribution-free test comparing two multivariate distributions based on adjacency[J].Journal of the Royal Statistical Society:Series B;Statistical Methodology,2005,67(4):515-530.
  • 5Aslan B,Zech G.New test for the multivariate two-sample problem based on the concept of minimum energy[J].Journal of Statistical Computation and Simulation,2005,75(2):109-119.
  • 6CHEN H, Friedman J H. New graph-based two-sample tests for multivariate distributions [BE/OL]. 2013-07-15. http://arxiv. org/abs/1307. 629.
  • 7CHEN L,DOU W W,QIAO Z.Ensemble subsampling for imbalanced multivariate two-sample tests[J].Journal of the American Statistical Association,2013,108(504):1308-1323.
  • 8王永吉,杨慧中.基于K近邻的支持向量机多模型建模[J].江南大学学报(自然科学版),2010,9(1):7-10. 被引量:4
  • 9孙晓燕,张化祥,计华.用于不均衡数据集分类的KNN算法[J].计算机工程与应用,2011,47(28):143-145. 被引量:9
  • 10石静,邱立坤,王菲,等.相似词获取的集成方法[C]//孙茂松,陈群秀.中国计算语言学研究前沿进展(2009-2011),北京:清华大学出版社,2011:277-283.

二级参考文献67

共引文献71

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部