期刊文献+

结合匹配度和语义相似度的Deep Web查询接口模式匹配 被引量:1

Deep Web query interface schema matching based on matching degree and semantic similarity
下载PDF
导出
摘要 查询接口模式匹配是Deep Web信息集成中的关键部分,双重相关性挖掘方法(DCM)能有效利用关联挖掘方法解决复杂接口模式匹配问题。针对DCM方法在匹配效率、匹配准确性方面的不足,提出了一种基于匹配度和语义相似度的新模式匹配方法。该方法首先使用矩阵存储属性间的关联关系,然后采用匹配度计算属性间的相关度,最后利用语义相似度计算候选匹配的相似性。通过在美国伊利诺斯大学的BAMM数据集上进行实验,所提方法与DCM及其改进方法比较有更高的匹配效率和准确性,表明该方法能更好地处理接口之间模式匹配问题。 Query interface schema matching is a key step in Deep Web data integration.Dual Correlated Mining(DCM) is able to make full use of association mining method to solve the problems of complex interface schema matching.There are some problems about DCM,such as inefficiency and inaccuracy in matching.Therefore,a new method based on matching degree and semantic similarity was presented in this paper to solve the problems.Firstly,the method used correlation matrix to save the association relationship among attributes;and then,matching degree was applied to calculate the degree of correlation between attributes;at last,semantic similarity was used to ensure the accuracy of final results.The experimental results on BAMM data sets of University of Illinois show that the proposed method has higher precision and efficiency than DCM and improved DCM,and indicate that the method can deal with the query interface schema matching problems very well.
作者 冯永 张洋
出处 《计算机应用》 CSCD 北大核心 2012年第6期1688-1691,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(61103114) 重庆市高等教育教学改革研究重点项目(112023) "211工程"三期建设项目(S-10218) 中央高校基本科研业务基金资助项目(CDJXS11181164)
关键词 DEEP WEB 模式匹配 匹配度 语义相似度 Deep Web schema matching matching degree semantic similarity
  • 相关文献

参考文献14

  • 1The Deep Web: surfacing hidden value[ EB/OL]. [ 2011- 10- 20]. http://brightplanet, com.
  • 2JAYANT M, JEFFERY S R, COHEN S. Web-scale data integra- tion: you can only afford to pay as you go[ EB/OL]. [2011-10- 22]. http://www, eidrdb, org/cidr2007/papers/eidr07p40, pdf.
  • 3DONG YONGQUAN, LI QINGZHONG, DING YANHUI, et al. ET- TA-IM: A deep Web query interface matching approach based on evidence theory and task assignment[ J]. Expert Systems with Appli- cations, 2011,38(8) : 10218 - 10228.
  • 4姜芳艽,孟小峰.Deep Web数据集成中查询处理的研究与进展[J].计算机科学与探索,2009,3(2):113-129. 被引量:4
  • 5HE B, CHANG K C. Statistical schema matching across Web query interfaces[ C] // Proceedings of the 22nd ACM SIGMOD Internation- al Conference on Management of Data. New York: ACM, 2003:217 - 228.
  • 6HE BIN, CHANG K C C, HAN JIAWEI. Discovering complex matching across Web query interfaces: a correlation mining approach [ C]// Proceedings of the lOth International Conference on Knowl- edge Discovery and Data Mining. New York: ACM, 2004:148 - 157.
  • 7WU W, YU C, DOAN A, et al. An interactive clustering-based ap- proach to integrating source query interface on the deep Web[ C]// Proceedings of ACM SIGMOD International Conference on Manage- merit of Data. New York: ACM, 2004:95 - 106.
  • 8MADHAVAN J, BERNSTEIN P A, DOAN A, et al. Corpus-based schema matching[ C]// Proceedings of the 21 st International Confer- ence on Data Engineering. Washington, DC: IEEE Computer Socie- ty, 2005:57 -68.
  • 9伊卫国,卫金茂,王名扬.挖掘有效的关联规则[J].计算机工程与科学,2005,27(7):91-94. 被引量:9
  • 10FELLBAUM C. WordNet: an electronic lexical database[ M]. Cam- bridge, Massachusetts: MIT Press, 1998.

二级参考文献37

  • 1He Bin, Chang K C C. Statistical Schema Matching Across Web Query Interfaces[C] //Proc. of the ACM SIGMOD International Conf. on Management of Data. San Diego, California, USA:[s. n.] , 2003.
  • 2Madhavan J, Bernstein P A, Doan A, et al. Corpus-based Schema Matching[C] //Proc. of the 21st International Conf. on Data Engineering. Tokyo, Japan:[s. n.] , 2005.
  • 3He Bin, Chang K C C. Automatic Complex Schema Matching Across Web Query Interfaces: A Correlation Mining Approach[J]. ACM Transactions on Database Systems, 2006, 31(1): 1-45.
  • 4Wu Wensheng, Yu C, Doan A, et al. An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web[C] //Proc. of ACM SIGMOD International Conf. on Management of Data. Paris, France:[s. n.] , 2004.
  • 5JIANG Fangjiao JIA Linlin MENG Xiaofeng.Query Translation on the Fly in Deep Web Integration[J].Wuhan University Journal of Natural Sciences,2007,12(5):819-824. 被引量:2
  • 6R Agrawal, T Imielinski, A Swami.Mining Association Rules Between Sets of Items in Large Databases [A].Proc 1993 ACM SIGMOD Conf on Management of Data[C].1993.207-216.
  • 7http://www.ics.uci.edu/~mlearn/MLSummary.html,2003-05.
  • 8Halevy A Y, Rajaraman A, Ordille J J. Data integration: The teenage years//Proceedings of the 32nd International Conference on Very Large Data Bases. Seoul, 2006:9-16
  • 9Elmagarmid A K, Ipeirotis P G, Verykios V S. Duplicate record detection: A survey. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(1): 1-16
  • 10He H, Meng W, Yu C T, Wu Z. WISE-integrator: An automatic integrator of Web search interfaces for E-commerce// Proceedings of the 29th International Conference on Very Large Data Bases. Berlin, 2003:357-368

共引文献26

同被引文献13

  • 1赵朋朋,崔志明,高岭,仲华.关于中国Deep Web的规模、分布和结构[J].小型微型计算机系统,2007,28(10):1799-1802. 被引量:13
  • 2Sherman C,Price G.The invisible Web:uncovering information sources search engines can't see[M].Medford,New Jersey,USA:Information Today,Inc,2001.
  • 3Chang Kevin Chen-Chuan,He Bin,Zhang Z.Structured databases on the Web:observations and implications[J].SIGMOD Record,2004,33(3).
  • 4Bergman M.The Deep Web:surfacing hidden value[J].The Journal of Electronic Publishing,2001,7(1):8912-8914.
  • 5Fetterly D,Manasse M,Najork M.A large-scale study of the evolution of Web pages[J].Software-Practice and Experience,2003,1(1).
  • 6Chang Kevin Chen-Chuan,He Bing,Zhang Z.Toward large scale integration:building a meta querier over databases on the Web[C]//CIDR,Asilomar,Galifornia,2005.
  • 7He H,Meng W Y,Lu Y Y,et al.Towards deeper understanding of the search interfaces of the Deep Web[J].Word Wide Web Journal,2007,10(2):133-155.
  • 8He Bin,Chang K C C.Statistical schema matching across Web query interfaces[C]//SIGMOD Conference,San Diego,California,USA,2003:217-228.
  • 9Wu W,Yu C,Doan A,et al.An interactive clustering-based approach to integrating source query interfaces on the Deep Web[C]//SIGMOD Conference,Paris,2004.New York:ACM Press,2004:95-106.
  • 10Hacene M R,Napoli A.Ontology learning from text using relational concept analysis[C]//Proceedings of International MCETECH Conference on e-Technologies,2008:154-163.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部