期刊文献+

基于遗传神经网络的相似重复记录检测方法研究 被引量:1

Study on Approximately Duplicate Record Detection Method Based on Genetic Neural Network
下载PDF
导出
摘要 设计实现了一个相似重复记录检测系统,该系统包括预处理模块、聚类模块、字段匹配模块和记录匹配模块,支持聚类算法和字段匹配算法的定制扩充。并通过实验对比了几种著名的算法,实验结果表明该系统提高了相似重复记录检测的精确度。 An extensible duplicates detecting system is designed and implemented. This system includes data preparation module, clustering module, field matching module and record matching module. The working principle and implementa- tion mechanism in process of the four modules are given respectively in this dissertation. In our experiments, we compare the performance of our method with some famous approximately duplicate records detecting algorithms. The experiment results show that the system improved the precision.
机构地区 空军雷达学院
出处 《舰船电子工程》 2011年第2期168-170,176,共4页 Ship Electronic Engineering
关键词 遗传神经网络 相似重复记录检测系统 聚类算法 字段匹配算法 genetic neural network, approximately duplicates detecting system, clustering algorithm, field matching algorithm
  • 相关文献

参考文献4

  • 1Hernandez M, Stolfo S. The merge/purge problem for large databases[J].ACM SIGMOD Record, 1995, 24 (2):127-138.
  • 2Monge A, Elkan C. An efficient domain-independent algorithm for detecting approximately duplicate data- base records[C]//Proceedings of the ACM-SIGMOD Workshop on Research Issues on Knowledge Discorvery and Data Mining, Tucson, AZ,1997.
  • 3Hernandez M A, Stolfo S J. Real-world data is dirty: clara cleansing and the merge/purge problem[J]. Data Mini ng and Knowledge Discovery, 1998,2 ( 1 ) ; 9-37.
  • 4A. Monge, C. Elkan. An effieient domain independent algorithm for detecting approximately duplicate data- base reeords[J]//proeeedings of the SIGMOD work- shop on Data Mining and Knowledge Diseovery, Tue- son, Arizona, 1997 : 211~217.

同被引文献9

  • 1Tim Bass.Multisensor Data Fusion for Next Generation Dis-tributed Intrusion Detection Systems[C].In Proceedings of1999IRIS National Symposium on Senor and Data Fusion,.A-merica:The Johns Hopkins University,1999:1-6.
  • 2Tim Bass.Intrusion Detection Systems and Multisensor DataFusion:Creating Cyberspace Situational Awareness[J].Com-munications of the ACM,2000,43(4):99-105.
  • 3Jason Shifflet.A Technique Independent Fusion Model forNetwork Intrusion Detection[C].Proceedings of the MidstatesConference on Undergraduates Research in Computer Scienceand Mathematics,America:Denison University,2003(1):13-19.
  • 4Stephen Lau.The Spinning Cube of Potential Doom[C].Com-munications of The ACM,2004,47(6):25-26.
  • 5Carnegie Mellcn’s SEI.System for Internet Level Knowledge(SILK)[EB/OL].http://silktools.source forge.net,2011-05.
  • 6北京理工大学信息安全与对抗技术研究中心.网络安全态势评估系统技术白皮书[EB/OL].http://~.thinkor.com/product/dowmload/网络安全态势评估系统白皮书2.doc.2011-05.
  • 7郝树勇,宣蕾,张卓.基于云的网络安全态势预测规则挖掘算法研究[J].计算机与数字工程,2010,38(8):141-144. 被引量:2
  • 8贾焰,王晓伟,韩伟红,李爱平,程文聪.YHSSAS:面向大规模网络的安全态势感知系统[J].计算机科学,2011,38(2):4-8. 被引量:36
  • 9高斐,王慧强.基于多源数据特征的服务安全态势感知模型研究[J].武汉大学学报(理学版),2011,57(2):165-169. 被引量:3

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部