信息检索的概率模式匹配

Information retrieval for probabilistic schema matching

下载PDF

导出

摘要为解决不同的计算机平台、数据存储格式、文档模型以及结构文档模式的异构性,以及联邦数字图书馆和信息检索等应用环境中将一种模式下的数据结构转换成另一种模式下数据结构的需求。提出一个基于概率的模式匹配映射框架,称作PMap,使用概率论的方法,给出候选预测权值的概率学解释,从而选择一个最优的匹配方式。模式匹配就是寻找异构模式之间一致性,将主要应用在数据交换和联邦数字图书馆中的分布式信息检索领域中,使得异构文档获得统一的检索格式。 Distributed information systems tend to be highly heterogeneous, integrate different computer platforms, data storage formats, document models and schemas which structure the documents and the latter aspectrequires to transform data structured under one schema into data structured under a different schema. For these reason, a probabilistic framework is introduced, called PMap. Our approach gives a probabilistic interpretation of the prediction weights of the candidates, selects the rule set with highest matching probability. Schema matching is the problem of finding correspondences （mapping rules, e.g. logical formulae） between heterogeneous schemas e.g. in the data exchange domain, or for distributed IR in federated digital libraries. The union formulae is formed by IR heterogeneous.

作者孙岩岩陈飞丛喜慧

机构地区中国环境管理干部学院燕山大学

出处《计算机工程与设计》 CSCD 北大核心 2008年第17期4626-4628,共3页 Computer Engineering and Design

基金秦皇岛市2006年科学技术研究与发展指导计划基金项目(20060286) 中国环境管理干部学院院内科研基金项目(S2006020) 燕山大学科技发展基金项目(YDJJ200591)

关键词模式匹配概率论 PMap 数据交换概率论 schema matching probability theory PMap data exchange probability theory

分类号 TP302.1 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献8

1Lenzerini M.Data integration:a theoretical perspective[C]. Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposiumon Principles of Database Systems,2002:233-246.
2Fagin R, Kolaitis P G, Miller R, et al. Data exchange: semantics and query answering[C].Proceedings of the International Conference on Database Theory,2003:207-224.
3严武军,马小燕.高校数字图书馆元数据检索系统的设计与实现[J].计算机工程与设计,2006,27(1):162-164. 被引量：15
4Norbert Fuhr.Probabilistic datalog: implementing logical information retrieval for advanced applications [J]. Journal of the American Society of Information Science,2000,51 (2):95-110.
5Madhavan J, Bernstein P, Chen K, et al.Corpus-based schema matching[C].Proceedings of the 21st International Conference on Data Engineering. IEEE Computer Society,2005:57-68.
6Rahm E, Bemstein P A. A survey of approaches to automatic schema matching[J]. The VLDB Journal,2001,10(4):334-350.
7Bilke A, Neumann F. Schema matching using duplicates [C]. Proceedings of the 21 st International Conference on Data Engineering. IEEE Computer Society,2005:69-80.
8Sebastiani F. Machine learning in automated text categorization [J].ACM Computing Surveys,2002,34(1): 1-47.

二级参考文献5

1马小燕,严武军.XML技术在数字图书馆的应用研究[J].太原师范学院学报（自然科学版）,2003,2(4):20-22. 被引量：4
2陈会仓.数字图书馆的检索技术[J].现代电子技术,2001,24(6):71-74. 被引量：8
3吕健强.ASP中文全文检索在SQL Server 2000中的实现[J].计算机与农业,2003(1):25-25. 被引量：2
4陈艳梅.基于元数据的数字图书馆信息资源组织[J].大学图书情报学刊,2003,21(1):40-42. 被引量：11
5王松林.DC-Lib——我国数字图书馆元数据的首选[J].中国图书馆学报,2004,30(1):55-59. 被引量：10

共引文献14

1王彩虹.元数据在高校数字图书馆信息资源中的应用[J].湖北师范学院学报（哲学社会科学版）,2007,27(6):137-139. 被引量：4
2孙素云.基于Web服务统一检索系统的设计[J].现代计算机,2007,13(4):79-81. 被引量：7
3严武军.信息工程监理在高校数字图书馆中的应用[J].现代计算机,2007,13(8):67-68.
4孙素云.基于元数据集成检索系统的设计与实现[J].广东轻工职业技术学院学报,2007,6(2):10-13. 被引量：1
5李卫峰,胡孔法.基于XML WEB SERVICE的数字图书馆统一检索技术研究[J].情报杂志,2008,27(9):27-28. 被引量：4
6魏清凤,贺立源,黄魏,余秋华.网络农业信息资源元数据研究及其著录管理系统开发[J].现代情报,2009,29(2):52-56. 被引量：5
7龙海威,张波,朱昊.元数据在公共图书馆的应用[J].科学咨询,2009(7):45-45.
8蔡昭权,卢庆武,郑宗晖,罗伟.基于元数据的快速开发平台设计与实现[J].计算机工程,2009,35(9):60-62. 被引量：12
9严武军.高校数字图书馆的网络系统安全研究[J].电脑开发与应用,2009,22(11):67-68. 被引量：1
10严武军.基于Jena规则推理数字图书馆信息检索系统研究[J].电脑开发与应用,2010,23(2):40-42. 被引量：7

1王昱.基于ExtJS的JSON数据交换格式研究[J].现代计算机,2013,19(2):61-62. 被引量：3
2陈蕾,杨鹏.HMIPv6域间切换方法的性能分析[J].重庆科技学院学报（自然科学版）,2008,10(3):78-80. 被引量：1
3蔡镇河,张旭,栾江霞.CPU+GPU异构模式下并行计算效率研究[J].计算机与现代化,2012(5):185-188. 被引量：5
4金广,李宏.一种异构数据模式集成的方法及实现[J].绍兴文理学院学报,2008,28(7):43-47.
5金广,李宏.一种异构数据模式集成的方法及实现[J].湖南科技学院学报,2008,29(4):88-90.
6孙学军.信息系统中异构数据库集成关键技术研究[J].河南机电高等专科学校学报,2006,14(1):24-26. 被引量：2
7王甲民,杨子翔,沈均毅.用UML设计XML文档模式[J].计算机工程与应用,2002,38(22):131-133. 被引量：2
8袁占亭,张秋余,李威.数据抽取及语义分析在Web数据挖掘中的应用[J].计算机工程与设计,2005,26(6):1425-1427. 被引量：6
9王颖,林亮亮.基于语义的Web数据交换[J].信息技术,2005,29(9):76-79.
10宋国平.基于异构模式的云计算关键技术研究[J].计算机光盘软件与应用,2013,16(21):25-26.

计算机工程与设计

2008年第17期

浏览历史

内容加载中请稍等...

信息检索的概率模式匹配

参考文献8

二级参考文献5

共引文献14

相关作者

相关机构

相关主题

浏览历史