期刊文献+

基于边界样本的训练样本选择方法 被引量:15

A Method for the Selection of Training Samples Based on Boundary Samples
下载PDF
导出
摘要 以入侵检测系统中的分类器设计为例,研究分类器训练样本选择问题.提出了一种大规模数据集的训练样本选择方法.首先通过聚类将训练数据划分为不同的子集,缩小搜索范围;然后根据聚类内离散度和样本的覆盖区域选择样本,保留每个聚类的边界样本,删除内部样本.仿真结果证实,由于保留了典型样本,减少了训练样本数量,从而保证了分类器的性能且训练效率较高. Taking the example of designing classifier in intrusion detection system, the selection of training samples for classifier is studied. A new method is proposed for sample selection in large data set. First, it will reduce the size of selection problem via clustering, select samples according to the with-in cluster scatter value and coverage area of a sample. And it will retain boundary samples and discard most of the interior ones in each cluster. Experiment result shows that as reserving typical samples and reducing training samples, the generalization performance and training efficient of the classifier are guaranteed.
作者 张莉 郭军
出处 《北京邮电大学学报》 EI CAS CSCD 北大核心 2006年第4期77-80,共4页 Journal of Beijing University of Posts and Telecommunications
基金 国家自然科学基金项目(60475007)
关键词 样本选择 离散度 覆盖区域 边界样本 sample selection scatter coverage area boundary samples
  • 相关文献

参考文献9

  • 1郭代飞,杨义先,胡正名.基于大规模网络的自适应入侵响应模型研究[J].北京邮电大学学报,2004,27(1):79-83. 被引量:11
  • 2Wilson D R, Martinez T R. Instance pruning techniques [C]// Proceedings of the 14th International Conference. San Francisco: Morgan Kaufmann Publishers Inc, 1997:404-411.
  • 3Astrahan M M. Speech analysis by clustering, or the hyper-phoneme method [R]. Calif: Stanford Univ, 1970.
  • 4Mitra P, Pal S K. Density-based multiscale data condensation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(6): 734-747.
  • 5Ng W W Y, Yeung D S, Cloete I. Input sample selection for rbf neural network classification problems using sensitivity measure [C]// IEEE International Conference on Systems Man and Cybernetics. Washington: [s. n.], 2003: 2593-2598.
  • 6Tambouratzis T. Counter-clustering for training pattern selection [J]. The Computer Journal, 2000, 43 (3) :177-190.
  • 7Lyhyaoui A, Ynez M M, Mora I. Sample selection via clustering to construct support vector-like classifiers [J]. IEEE Transactions on Neural Networks, 1999, 10 (6) :1474-1480.
  • 8Brighton H, Mellish C. Advances in instance selection for instance-based learning algorithms [J]. Data Mining and Knowledge Discovery, 2002, 6(2): 153-172.
  • 9Luo Dingsheng, Chen Ke. Refine decision boundaries of a statistical ensemble by active learning [C] // International Joint Conference on Neural Networks. Portland: [s.n.], 2003: 1523-1528.

二级参考文献1

共引文献10

同被引文献82

引证文献15

二级引证文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部