期刊文献+

基于QPSO-LSSVM的数据库相似重复记录检测算法 被引量:6

Approximate Duplicate Record Detection Algorithm Based on PSO and LSSVM
下载PDF
导出
摘要 针对大规模数据库的相似重复记录的检测问题,提出了一种量子群优化算法(QPSO)与最小二乘支持向量机(LSSVM)相结合的相似重复记录检测方法(QPSO-LSSVM)。首先计算记录字段的相似度值;然后利用QPSO对LSSVM参数进行优化,构建相似重复记录检测模型;最后通过具体数据集进行仿真测试实验。仿真结果表明,QPSO-LSSVM不仅提高了重复记录检测准确率,而且提高了检测效率,是一种有效的相似重复记录检测算法。 Approximately duplicate record detection algorithm was proposed based on quantum swarm algorithm(QPSO) and least squares support vector machine(LSSVM) to solve the large-scale database approximation duplicate record detection problem.Firstly,the record field similarity values are calculated,and then the LSSVM parameters are optimized,by QPSO to construction the approximately duplicate records detection model,finally simulation experiments are carried out on the data set.The simulation results show that QPSO-LSSVM not only improves the accuracy of the duplicate record detection but also improves the detection efficiency,and it is an effective approximate duplicate record,Detection algorithm.
出处 《计算机科学》 CSCD 北大核心 2012年第11期157-159,190,共4页 Computer Science
基金 河南省科学技术厅科技攻关科学项目(112102210199) 河南省科学技术厅基础与前言研究项目(112300410201)资助
关键词 量子粒子群优化算法 最小二乘支持向量机 相似重复记录 检测 Quantum particle swarm optimization Least square support vector machines Approximately duplicate record Detection
  • 相关文献

参考文献12

二级参考文献85

共引文献335

同被引文献47

  • 1刘厚贵,邢晶,霍志刚,安学军.一种支持海量数据备份的可扩展分布式重复数据删除系统[J].计算机研究与发展,2013,50(S2):64-70. 被引量:5
  • 2夏定元,刘书宇,周曼丽1.基于小波和相对矩的形状特征提取与检索方法[J].计算机工程,2004,30(20):146-147. 被引量:1
  • 3葛利.一种基于混合遗传算法学习的过程神经网络[J].哈尔滨工业大学学报,2005,37(7):986-988. 被引量:21
  • 4费园园,孙劲光,陶志勇.基于小波分解和灰度共生矩阵的纹理图像检索[J].现代计算机,2007,13(10):58-59. 被引量:2
  • 5Imagarmid A K, Ipeirotis P G, Verykios V S. Duplicate record detec- tion:a survey [ J ]. IEEE Transactions on Knowledge and Data Engi- neering,2007,19 ( 1 ) : 1 - 16.
  • 6Li Huang, Hai Jin, Pingpeng Yuan, et al. Duplicate records cleansing with length filtering and dynamic weighting [ C ]. Fourth International Conference on Semantics, Knowledge and Grid. Beijing: IEEE Press, 2008:95 - 102.
  • 7Coelho L S. Gaussian quantum behaved particle swarm optimization ap- proaches for constrained engineering design problems [ J ]. Expert Sys- tems with Applications,2010,37 (2) : 1676 - 1683.
  • 8Sun J, Fang W, Xu X J, et al. Quantum-Behaved Particle Swarm Opti- mization: Analysis of the Individual Particle' s Behavior and Parameter Selection [ J ]. Evolutionary Computation,2012,20 ( 3 ) : 349 - 393.
  • 9CLEMENTS A T, AHMAD I, VILAYANNUR M, et al. Decentral- ized deduplication in SAN cluster file systems [ C]// Proceedings of the 2009 USENIX Annual Technical Conference. Berkeley, CA: USENIX Association, 2009:101 - 114.
  • 10ESHGHI K, LILLIBRIDGE M, WILCOCK L, et al. Jumbo Store: providing efficient incremental upload and versioning for a utility rendering service [ C] // Proceedings of the 5th USENIX Conference on File and Storage Technologies. Berkeley, CA: USENIX Associa- tion, 2007:123 - 138.

引证文献6

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部