String核负实例语法特征提取算法

Grammatical Feature Extraction Algorithm for String Kernel False Instance

下载PDF

导出

摘要通过String核方法把语法数据库中的负实例转化成核矩阵,采用Kmeans聚类算法对核矩阵进行聚类,将原始负实例数据库分成多个容量较小的特征数据表,使大规模O(n3)核矩阵转换为n/s×O(s3)(s<<n)矩阵,以减少运算量。分析语法检查精度随Kmeans聚类参数的变化规律。实验结果表明,该算法在不降低语法检查精度的前提下提高了语法检查速度。 This paper translates false instance in grammatical database to kernel matrix through String kernel method, uses Kmeans clustering method to cluster the kernel matrix and separate the original false instance database into many characteristic tables with small capacitance. It transforms large scale O（n^3） kernel matrix into n/s×O（s^3）（s〈〈n） matrix to decrease calculation amount, and analyzes the rule of the grammatical check accuracy with the change of Kmeans clustering parameters. Experimental results show that this algorithm can enhance the running speed without decreasing the accuracy of grammatical check.

作者吕威林文昶李磊

机构地区北京师范大学珠海分校信息技术学院中山大学软件研究所

出处《计算机工程》 CAS CSCD 北大核心 2009年第23期12-14,共3页 Computer Engineering

基金国家自然科学基金资助项目(10471156 10531040)

关键词 Kmeans方法聚类 String核负实例特征提取 Kmeans method clustering String kernel false instance feature extraction

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1Golding A R. A Window-based Approach to Context-sensitive Spelling Correction[J]. Machine Learning, 1999, 34(1-3): 107-130.
2Watson I, Marir F. Case-based Reasoning: A Review[J]. Knowledge Engineering Review, 1994, 9(4): 355-381.
3Lodhi H, Saunders C, Shawe-Taylor J, et al. Text Classification Using String Kernels[J]. The Journal of Machine Learning Research, 2002, 2: 419-444.
4Macqueen J. Some Methods for Classification and Analysis of Multivariate Observations[C]//Proc. of the 5th Berkeley Symp. on Math. Statist. Berkeley, CA, USA: University of California Berkeley Press, 1967:281-297.
5Muller K R. An Introduction to Kernel-based Learning Algorithms[J]. IEEE Transactions on Neural Networks, 2001, 12(2): 181-201.

1王玲.仿真中影响焊枪可达性精度的探究[J].汽车制造业,2014(18):66-67.
2张翔,吝睿涛.一种分布式中文微博热点话题的发现方法[J].无线互联科技,2014,11(12):168-169.
3张友海,李锋刚.基于MapReduce的KMeans聚类算法的并行化实现[J].九江学院学报（自然科学版）,2017,32(1):73-75. 被引量：2
4郭强,邵科技,朱鸿宇.基于MapReduce模型的并行KMeans聚类算法[J].高性能计算技术,2010,0(5):33-37.
5郭明,丁华福.基于SOM网和K-means的聚类算法[J].计算机与数字工程,2008,36(9):22-24. 被引量：6
6魏建香,刘怀,苏新宁.基于遗传算法的文档聚类算法的设计与仿真(英文)[J].南京大学学报（自然科学版）,2009,45(3):432-438. 被引量：4
7裘晨曦,徐雅斌,李艳平,李卓.一种基于无监督学习的社交网络流量快速识别方法[J].数学的实践与认识,2014,44(3):100-107. 被引量：1
8袁小艳.ABC_Kmeans聚类算法的MapReduce并行化研究[J].计算机测量与控制,2016,24(1):252-254. 被引量：5
9陈英,何中市,黄敏.一种优化的K-means聚类中心算法研究[J].制造业自动化,2012,34(8):19-22. 被引量：5
10王晓明,熊九龙,王志虎,祝夏雨,张玘.一种自动的图像分割方法[J].微型机与应用,2013,32(23):29-33.

计算机工程

2009年第23期

浏览历史

内容加载中请稍等...

String核负实例语法特征提取算法

参考文献5

相关作者

相关机构

相关主题

浏览历史