
数据库元数据的自动语义标注 被引量:3

Automatic Semantic Annotation for Metadata in Databases
摘要 语义异构是异构数据库信息集成中要解决的关键问题。为了使关系数据库的表和字段具有语义信息,将数据库元数据自动标注成语义元数据成为研究的热点。基于概念名和概念结构的语义相似度计算,提出了一种数据库元数据自动语义标注方法。首先从关系数据库的元数据中提取隐含的语义信息,并据此创建领域本体,然后通过计算元数据与本体实体间的语义相似度对提取的元数据进行自动语义标注,提出的相似度算法综合考虑了概念名称和结构的相似性,并采取了必要的优化措施进行改进。经实验测试证明,该方法具有较高的标注正确率,是一种行之有效的语义标注方法。 语义异构是异构数据库信息集成中要解决的关键问题。为了使关系数据库的表和字段具有语义信息,将数据库元数据自动标注成语义元数据成为研究的热点。基于概念名和概念结构的语义相似度计算,提出了一种数据库元数据自动语义标注方法。首先从关系数据库的元数据中提取隐含的语义信息,并据此创建领域本体,然后通过计算元数据与本体实体间的语义相似度对提取的元数据进行自动语义标注,提出的相似度算法综合考虑了概念名称和结构的相似性,并采取了必要的优化措施进行改进。经实验测试证明,该方法具有较高的标注正确率,是一种行之有效的语义标注方法。
出处 《计算机科学》 CSCD 北大核心 2012年第S3期159-162,共4页 Computer Science
关键词 语义标注 元数据 本体 数据库 Semantic annotation Metadata Ontology Database
  • 相关文献


  • 1方丽英,王普,闫健卓.面向语义异构的信息集成系统查询处理方案[J].北京工业大学学报,2007,33(8):819-822. 被引量:3
  • 2许斌,李涓子,王克宏.Web服务语义标注方法[J].清华大学学报(自然科学版),2006,46(10):1784-1787. 被引量:23
  • 3Viljanen K,Tuominen J,Hyvnen E,et al.Extending contentmanagement systems with ontological annotation capabilities. http://www.seco.tkk.fi/publications/2007/vil-janen-tuomin en-hyvonen-et-al-extending-content-management-systems-with-ontological-annotation-capabilities.pdf . 2012
  • 4Gonzlez M,Bianchi S,Vercelli G.Semantic framework for com-plex knowledge domains. http://ceur-ws.org/Vol-401/iswc2008pd_submission_17.pdf . 2012
  • 5Paul,Costache S,Nejdl W,et al.P-TAG:Large scale automaticgeneration of personalized annotation tags for the Web. Proceedings of the 16th International Conference on World WideWeb . 2007
  • 6Levenshtein VI.Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics-Doklady . 1966
  • 7S. Dill,N. Eiron,D. Gibson,D. Gruhl,R. Guha,A. Jhingran,T. Kanungo,S. Rajagopalan,A. Tomkins,J. Tomlin,J. Zien.SemTag and Seeker:Bootstrapping the semantic Web via automated semantic annotation. Proceedings of the 12th International Conference on World Wide Web . 2003
  • 8Yang H C,Lee C H.Automatic Metadata Generation for Web Pages Using a Text Mining Approach. International Work- shopon Challenges in Web Information Retrieval and Integra- tion . 2005
  • 9Viljanen K,Tuominen J,Hyvnen E,et al.Extending contentmanagement systems with ontological annotation capabilities. http://www.seco.tkk.fi/publications/2007/vil-janen-tuomin en-hyvonen-et-al-extending-content-management-systems-with-ontological-annotation-capabilities.pdf . 2012
  • 10Gonzlez M,Bianchi S,Vercelli G.Semantic framework for com-plex knowledge domains. http://ceur-ws.org/Vol-401/iswc2008pd_submission_17.pdf . 2012


  • 1梁邦勇,李涓子,王克宏.基于语义Web的网页推荐模型[J].清华大学学报(自然科学版),2004,44(9):1272-1276. 被引量:9
  • 2孟小峰,周龙骧,王珊.数据库技术发展趋势[J].软件学报,2004,15(12):1822-1836. 被引量:176
  • 3张蓉,申德荣,于戈.Ontology在异构数据库集成中的应用[J].计算机工程,2004,30(24):29-31. 被引量:9
  • 4Singh P M,Huhns N M.Service-Oriented Computing Semantics,Processes,Agents[M].England:John Wiley&Sons,Ltd,2005.
  • 5Verma K,Sivashanmugam K,Sheth A,et al.METEOR-S WSDI:A scalable infrastructure of registries for semantic publication and discovery of Web services[J].Journal of Information Technology and Management,Special Issue on Universal Global Integration,2005,6(1):17-39.
  • 6Hess A,Kushmerick N.Learning to attach semantic metadata to Web services[C]∥ Proc of the 2nd International Semantic Web Conference.Florida,USA:Springer,2003:258-273.
  • 7Patil A,Oundhakar S,Sheth A,et al.METEOR-S Web service annotation framework[C]∥ Proc of WWW2004.New York,USA:ACM Press,2004:553-562.
  • 8Martin D,Burstein M,Hobbs J,et al.OWL-S:Semantic markup for Web Services[EB/OL].2004.http://www.daml.org/services/owl-s/1.1/overview/.
  • 9Ferdinand M,Zirpins C,Trastour D.Lifting XML schema to OWL[C]∥ Proc of the 4th International Conference on Web Engineering.Munich,Germany:Springer,2004:354-358.
  • 10ZHANG Duo,LI Juanzi,XU Bin.Web service annotation using ontology mapping[C]∥ Proc of 1st International Workshop on Service Oriented System Engineering.Beijing,China:IEEE,2005:235-242.



  • 1赵作鹏,尹志民,王潜平,许新征,江海峰.一种改进的编辑距离算法及其在数据处理中的应用[J].计算机应用,2009,29(2):424-426. 被引量:51
  • 2李亢,李新明,刘东.面向数据语义集成的装备领域本体构建研究[J].系统仿真学报,2015,27(5):1071-1080. 被引量:6
  • 3Levenshtein V I. Binary codes capable of correcting deletions, insertions and reversals[ C]. Soviet physics dok- lady, 1966,10 : 707.
  • 4Wagner R A, Lowrance R. An extension of the string-to-string correction problem[J ]. Journal of the ACM (JACM), 1975, 22(2) : 177 - 183.
  • 5Hjelmqvist Sten. Fast, memory efficient Levenshtein algorithm[ EB/OL]. http://www, codeproject, com/Ar- ticles/13525/Fast-memory-efficient-Levenshtein-algorithm, 2012.
  • 6Monge A E, Elkan C P. Efficient domain- independent detection of approximately duplicate database records [ C]. Proc. of the ACM-SIGMOD Workshop on Research Issues in on Knowledge Discovery and Data Mining, 1997.
  • 7Hernandez M A, Stolfo S J. The merge/purge problem for large databases[ C]. ACM, 1995, 24(2) : 127- 138.
  • 8Cohen W W. Data integration using similarity joins and a word-based information representation language[J ]. ACM Transactions on Information Systems (TOIS), 2000, 18(3) : 288 - 321.
  • 9Bakker D, Mfiller A, Velupillai V, et al. Adding typology to lexicostatistics: a combined approach to lan- guage classification[J]. Linguistic Typology, 2009, 13(1) : 169 - 181.
  • 10Holman E W, Wichmann S, Brown C H, et al. Advances in automated language classification[J ]. Quantita- tive Investigations In Theoretical Linguistics (QITL3), 2008: 40.










使用帮助 返回顶部