期刊文献+

String核负实例语法特征提取算法

Grammatical Feature Extraction Algorithm for String Kernel False Instance
下载PDF
导出
摘要 通过String核方法把语法数据库中的负实例转化成核矩阵,采用Kmeans聚类算法对核矩阵进行聚类,将原始负实例数据库分成多个容量较小的特征数据表,使大规模O(n3)核矩阵转换为n/s×O(s3)(s<<n)矩阵,以减少运算量。分析语法检查精度随Kmeans聚类参数的变化规律。实验结果表明,该算法在不降低语法检查精度的前提下提高了语法检查速度。 This paper translates false instance in grammatical database to kernel matrix through String kernel method, uses Kmeans clustering method to cluster the kernel matrix and separate the original false instance database into many characteristic tables with small capacitance. It transforms large scale O(n^3) kernel matrix into n/s×O(s^3)(s〈〈n) matrix to decrease calculation amount, and analyzes the rule of the grammatical check accuracy with the change of Kmeans clustering parameters. Experimental results show that this algorithm can enhance the running speed without decreasing the accuracy of grammatical check.
出处 《计算机工程》 CAS CSCD 北大核心 2009年第23期12-14,共3页 Computer Engineering
基金 国家自然科学基金资助项目(10471156 10531040)
关键词 Kmeans方法 聚类 String核 负实例 特征提取 Kmeans method clustering String kernel false instance feature extraction
  • 相关文献

参考文献5

  • 1Golding A R. A Window-based Approach to Context-sensitive Spelling Correction[J]. Machine Learning, 1999, 34(1-3): 107-130.
  • 2Watson I, Marir F. Case-based Reasoning: A Review[J]. Knowledge Engineering Review, 1994, 9(4): 355-381.
  • 3Lodhi H, Saunders C, Shawe-Taylor J, et al. Text Classification Using String Kernels[J]. The Journal of Machine Learning Research, 2002, 2: 419-444.
  • 4Macqueen J. Some Methods for Classification and Analysis of Multivariate Observations[C]//Proc. of the 5th Berkeley Symp. on Math. Statist. Berkeley, CA, USA: University of California Berkeley Press, 1967:281-297.
  • 5Muller K R. An Introduction to Kernel-based Learning Algorithms[J]. IEEE Transactions on Neural Networks, 2001, 12(2): 181-201.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部