
Deep Web中整体模式匹配方法的研究

A Method of Deepweb Schema Matching Based on Data Mining
摘要 在深入分析研究现有Deep Web模式匹配技术的基础上提出了一种新的模式匹配方法——整体模式匹配方法,通过数据挖掘的方法一次性发现多个模式间的复杂匹配,经实验和实际运行表明该方法具有较好的运行结果. A new approach named by holistic schema matching is proposed based on deeply analyzing and researching current technology for schema matching. This approach discovers the n-ray complex matching between multiple schemas at the same time by data mining. The experiment and the actual operation indicate that this approach has a fairly good result.
出处 《南开大学学报(自然科学版)》 CAS CSCD 北大核心 2012年第5期24-31,共8页 Acta Scientiarum Naturalium Universitatis Nankaiensis
基金 天津市重点项目(11JC2DJC28100) 国家科技支撑课题(2012BAF12B00)
关键词 DEEP WEB 复杂匹配 数据挖掘 整体模式匹配 deep web complex matching data mining holistic schema matching
  • 相关文献


  • 1He B, Chang K C C, Han J W. Discovering complex matchings across web query interfaces: a correlation mining ap- proach[C]//Proceeding of 10th ACM international Conference on Knowledge Discovery and Data Mining, August 22 -25, 2004, Seattle, WA, USA. New York: ACM Press, 2004: 148-157.
  • 2刘伟,孟小峰,孟卫一.Deep Web数据集成研究综述[J].计算机学报,2007,30(9):1475-1489. 被引量:136
  • 3Agrawal R, Imielinski T, Swami A N. Mining association rules between sets of items in large databases[C]//Pro- ceeding of the ACM SIGMOD 1993 Conference on management of data, May 26-28, 1993, Washington D C. New York: ACM Press, 1993: 207-216.
  • 4Madhavan J, Bernstein P, Doan A, et al. Corpus-based schema matching[C]//Proceeding of the 21st International Conference on Data Engineering. April 5-8, 2005, National Center of Sciences, Tokyo, Japan. Los Alamitos.. IEEE Computer Society, 2005 : 57- 68.
  • 5He B, Patel M, Zhang Z, et al. Accessing the Deep Web.. A Survery[J]. Communications of the ACM, 2007, 50 (5): 94-101.
  • 6Wanas N, E1-Saban M, Ashour H, et al. Automatic scoring of online discussion posts[C]//Proceedings of the 2nd ACM workshop on information credibility on the web 2008, California.. ACM, 2008: 19-25.
  • 7Ziegler C N, Lausen G. Propagation models for trust and distrust in social networks[J]. Information Systems Fron- tiers, 2005, 7(4/5): 337-358.
  • 8何玲玲,刘国华,孔令民.用于Web查询接口集成的模式匹配算法[J].计算机工程,2010,36(3):64-65. 被引量:3


  • 1He Bin, Chang K C C. Statistical Schema Matching Across Web Query Interfaces[C]//Proc. of SIGMOD'03. San Diego, California, USA: [s. n.], 2003: 1-10.
  • 2Rahm E, Bemstein P A. A Survey of Approaches to Automatic Schema Matching[J]. VLDB Journal, 2001, 10(4): 334-350.
  • 3.[EB/OL].http://www.cogsci.Princeton.edu,.
  • 4刘伟,孟小峰,孟卫一.Deep Web数据集成研究综述[J].计算机学报,2007,30(9):1475-1489. 被引量:136
  • 5Fetterly D,Manasse M,Najork M,Wiener J L.A largescale study of the evolution of Web pages//Proceedings of the 12th International World Wide Web Conference.Budapest,2003:669-678
  • 6Chang K C,He B,Li C,Patel M,Zhang Z.Structured databases on the Web:Observations and Implications.SIGMOD Record,2004,33(3):61-70
  • 7Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the Web//Proceedings of the 14th Australasian Database Conference(ADC 2003).Adelaide,2003:181-189
  • 8Zhang Z,He B,Chang K C.Understanding Web query interfaces:Best-effort parsing with hidden syntax//Proceedings of the 23rd ACM SIGMOD International Conference on Management of Data.Paris,2004:107-118
  • 9Arasu A,Garcia-Molina H.Extracting structured data from Web pages//Proceedings of the 22nd ACM SIGMOD International Conference on Management of Data.San Diego,2003:337-348
  • 10Crescenzi V,Mecca G,Merialdo P.RoadRunner:Towards automatic data extraction from large Web sites//Proceedings of the 27th International Conference on Very Large Data Bases.Italy,2001:109-118









使用帮助 返回顶部