
XML检索中的标签权重设置模型 被引量:1

Tag Weighting Model for XML Retrieval
摘要 XML检索时,考虑关键词在文档中的位置有助于改善检索效果,一种常用的方法是为文档中不同的标签赋予不同的权重,并根据关键词所在结点的标签合理地设置权重。然而,目前为标签赋予权重的方法大都是人工设置,这种方法工作量大且主观性强。提出了用主题概括强度衡量XML标签权重的方法,实验结果显示,该方法能有效提高XML检索的质量。 Taking the occurrence position of a term in XML (extensive makeup language) retrieval is helpful to improve the retrieval performance. The common method sets the weight of tag in XML document and integrates the tag weight into term weight model. However, tag weight is set manually in most related works, which is a subjective and heavy work. A tag weight model based on topic generalization is advanced, by which the tag weight is calculated automatically. Experiment results show that this model performs well in XML retrieval.
出处 《计算机科学与探索》 CSCD 2010年第8期723-730,共8页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金No.60803105 60763001 江西省教育厅科技项目No.GJJ08508~~
关键词 XML检索 标签权重 主题概括强度 XML retrieval tag weight topic generalization
  • 相关文献


  • 1孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据的查询技术[J].软件学报,2007,18(6):1400-1418. 被引量:72
  • 2Liu Ziyang,Walker J,Chen Yi.XSeek:A semantic XML search engine using keywords[C] //Proc of Conference on the 33rd Very Large Data Bases,Vienna,Austria,2007:1330-1333.
  • 3Jeong B,Lee D,Cho H,et al.A novel method for measuring semantic similarity for XML schema matching[J].Expert Systems with Applications,2008,34(3):1651-1658.
  • 4万常选,鲁远.基于权重查询词的XML结构查询扩展[J].软件学报,2008,19(10):2611-2619. 被引量:21
  • 5Chowdhury M,Thomo A,Wadge W W.Preferential infinitesimals for information retrieval[C] //Proc of the 5th IFIP Conference on Artificial Intelligence Applications and Innovations,Thessaloniki,Greece,2009:113-125.
  • 6World Wide Web Consortium.XQuery 1.0 and XPath 2.0data model(XDM)[S/OL].(2007-01-23).http://www.w3.org/TR/2007/REC-xpath-datamodel-20070123/.
  • 7Carnegie Mellon University and the University of Massachusetts.INDRI:Language modeling meets inference networks[EB/OL].[2010-03].http://www.lemurproject.org/indri/.
  • 8Voorhees E M.TREC-8 question answering track report[C] //Proc of the Conference on 8th Text Retrieval,1999:77-82.
  • 9Jairvelin K,Kekalaiinen J.Cumulated gain-based evaluation of IR techniques[J].ACM Transactions on Information Systems,2002,20:422-446.




  • 1Chowdhury M, Thomo A, Wadge W. Preferential infinitesi- mals for information retrieval//Proceedings of the 5th IFIP Conference on Artificial Intelligence Applications and Innova- tions. Thessaloniki, Greece, 2009 : 113-125.
  • 2Liu D, Wan Ch, Chen L, Liu X. Automatically weighting tags in XML collection//Proceedings of the 19th ACM International Conferences on Information and Knowledge Management. Toronto, Canada, 2010:1289-1292.
  • 3万常选,刘喜平.XML数据库技术.第2版.北京:清华大学出版社,2005.
  • 4Singhal A, Choi J, Hindle D, et al. ATb-T at TREC 7// Proceedings of the 7th Text REtrieval Conference, Gaithersburg, Maryland, USA, 1999: 239-252.
  • 5Husbands P, Simon H, Ding C. On the use of the singular value decomposition for text retrieval//Berry M. Computa- tional Information Retrieval. USA: Society for Industrial and Applied Mathematics Philadelphia, 2001:145-156.
  • 6Robertson S, Walker S, Hancock-Beaulieu M. Okapi at TREC-7: Automatic ad hoc, filtering, VLC and interactive tracks//Proceedings of the 7th Text REtrieval Conference, Gaithersburg, Maryland, USA, 1999:253-264.
  • 7Trappett M, Geva S, Trotman A, et al. Overview of the INEX 2011 snippet retrieval track//Proceedings of the 10th International Workshop of the Initiative for the Evaluation of XML Retrieval. Dagstuhl, Germany, 2011: 228-237.
  • 8Leal L, Scholer F, Thorn J. RMIT at INEX 2011 snippet retrieval track//Proceedings of the 10th International Work shop of the Initiative for the Evaluation of XML Retrieval. Dagstuhl, Germany, 2011:240-243.
  • 9Wang S, Hong Y, Yang J. PKU at INEX 2011 XML snippet track//Proceedings of the 10th International Workshop of the Initiative for the Evaluation of XML Retrieval. Dagstuhl, Germany, 2011:251-257.
  • 10Manning C, Raghavan P, Schtitze H. Introduction to Infor- mation Retrieval. Cambridge, UK: Cambridge University Press, 2008.










使用帮助 返回顶部