概率XML关键字检索排序算法被引量：1

A Ranking Algorithm of Keyword Search on Probabilistic XML Data

下载PDF

导出

摘要探讨了针对概率XML文档集中与内容相关的关键字检索结果的排序问题,针对概率XML文档的特征提出了一种新的排序模式.与仅取决于检索结果概率的检索排序算法不同,本文提出的排序算法充分考虑了节点对文档的区分程度、节点描述文档的程度,以及XML文档本身的结构特性,设计了满足以上特征的检索结果排序模型,并针对排序模型提出了新的倒排索引结构.新的排序算法可以快速完成关键字检索,并将最相关的信息提供给用户.模拟数据集实验验证了该方法的有效性. Discusses the problem of efficiently ranking the search results of keyword related only to content on probabilistic XML data.A newranking model is presented according to the characteristic of probabilistic XML data.Unlike the existing ranking algorithms which only depend on the probabilities of retrieval results,the newranking algorithm proposed fully considered the degrees of nodes discriminating and describing the documents and the characteristic of probabilistic XML data.A ranking model of retrieval results which satisfied the above features is designed and a newinverted index structure for the ranking model is proposed.The newalgorithm can accomplish keyword search quickly,so as to provide the most relevant information to the users.The results of simulation experiment showthat the proposed method is effective.

作者赵越袁野王国仁

机构地区东北大学计算机科学与工程学院沈阳大学信息工程学院

出处《东北大学学报（自然科学版）》 EI CAS CSCD 北大核心 2016年第8期1095-1099,共5页 Journal of Northeastern University(Natural Science)

基金国家自然科学基金资助项目(6100024 61332006 U1401256) 国家重点基础研究计划项目(2011CB302200-G) 中央高校基本科研业务费专项资金资助项目(N130504006)

关键词检索概率XML数据 SLCA 排序 keyword search probabilistic XML data SLCA（smallest lowest common ancestor） ranking

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1张利军,李战怀,陈群,娄颖,李宁.基于关键字语义信息的XML文档分类[J].吉林大学学报（工学版）,2012,42(6):1510-1514. 被引量：6

二级参考文献13

1Sebastiani F. Machine learning in automated textcategorization[J]. ACM Computing Surveys,2002,34(1):1-47.
2Tekli J,Chbeir R,Yetongnon K. An overview onXML similarity: background, current trends and fu-ture directions[J]. Computer Science Review,2009,3(3):151-173.
3Xing G,Guo J, Xia Z H. Classifying XML docu-ments based on structure/content similar-ity [C]//The 5 th International Workshop of the Initiative forthe Evaluation of XML Retrieval. Dagstuhl Castle,Berlin, Germany,2007 :444-457.
4Dalamagas T,Cheng T,Winel K J,et al. A meth-odology for clustering XML documents by structure[J]. Information Systems, 2006,31(3) : 187-228.
5Zaki M J, Aggarwal C C. XRules: an effectivestructural classifier for XML data[C]// Proceedingsof the 9th ACM SIGKDD International Conferenceon Knowledge Discovery and Data Mining, Wash-ington D C, 2003:316-325.
6Wu J W,Tang J. A bottom-up approach for XMLdocuments classification [C]// Proceedings of the12th International Database Engineering and Appli-cations Symposium. Coimbra,Portugal, 2008: 131-137.
7Tagarelli A, Greco S. Semantic clustering of XMLdocuments [J]. ACM Transactions on InformationSystems, 2010,28(1) : 1-56.
8Denoyer L,Gallinari P. The wikipedia XML corpus[J]. ACM SIGIR Forum, 2006,40(1) :64-69.
9Kurt A, Tozal E. Classification of XSLT-generatedweb documents with support vector machines[C]//Proceedings of the First International Workshop onKnowledge Discovery from XML Documents, Singa-pore, 2006 : 33-42.
10EMachine Learning Group at National Taiwan Uni-versity. Liblinear—a library for large linear classifi-cation[DB/OL]. [2010-09-25]. http: // www. csie.ntu. edu. tw/.cjlin/liblinear/.

共引文献5

1李瑞霞,苏守宝,周先存.一种基于语义相关度的XML关键字查询排序方法[J].吉林大学学报（理学版）,2013,51(6):1118-1122. 被引量：2
2耿庆田,狄婧,常亮,赵宏伟.基于B＋树的数据索引存储[J].吉林大学学报（理学版）,2013,51(6):1133-1136. 被引量：8
3盛步云,张成雷,卢其兵,李新龙,程旭东.云制造服务平台供需智能匹配的研究与实现[J].计算机集成制造系统,2015,21(3):822-830. 被引量：28
4刘慧敏.面向云制造服务的供需智能匹配引擎研究[J].武汉职业技术学院学报,2018,17(4):82-87.
5魏东平,马弋惠.XML文档分类中特征表达方法的研究[J].计算技术与自动化,2020,39(3):91-96.

同被引文献4

1杨磊,宋涛.基于数组的桶排序算法[J].计算机研究与发展,2007,44(2):341-347. 被引量：13
2冯元瑞.一种改进的计数排序算法[J].电脑编程技巧与维护,2014,0(22):16-18. 被引量：1
3余冬梅.一种基于堆的快速排序算法[J].科学技术与工程,2014,22(35):80-83. 被引量：3
4陈洪雁,万俊伟,汪琦.大数据高性能排序算法的设计与实现[J].飞行器测控学报,2015,34(2):120-127. 被引量：6

引证文献1

1左晓静,谭会君.计算机程序语言中常用排序算法分析研究[J].漯河职业技术学院学报,2018,17(2):54-56. 被引量：1

二级引证文献1

1王倩.基于计算机程序设计的排序问题分析[J].电脑知识与技术,2021,17(21):67-68.

1王建卫,郝忠孝.概率XML数据管理技术研究进展[J].计算机科学,2009,36(11):14-17. 被引量：3
2赵越,袁野,王国仁.概率XML数据上的ELCA关键字检索[J].计算机与数字工程,2014,42(9):1558-1564. 被引量：1
3张晓琳,郑珍珍,刘立新,李玉峰.连续概率XML数据查询处理技术[J].计算机工程与科学,2012,34(12):134-139. 被引量：1
4殷丽凤,金花,田宏.概率XML数据模型的综述[J].电子设计工程,2011,19(23):88-91. 被引量：2
5陈子阳,刘佳,张刘辉,周军锋.DeweyTP:一种面向概率XML数据的编码方案[J].通信学报,2013,34(11):26-32. 被引量：2
6金宇,殷丽凤,邱占芝.一种概率XML数据模型和查询代数[J].科技创新导报,2013,10(19):229-230.
7王建卫,郝忠孝.概率关系模式与概率XML模式转换算法的研究[J].计算机应用研究,2011,28(2):609-612. 被引量：2
8王建卫,郝忠孝.基于关系的概率XML数据存储方法研究[J].计算机工程与应用,2011,47(23):130-132.
9金宇,殷丽凤.一种概率XML Twig查询的计算[J].齐齐哈尔大学学报（自然科学版）,2009,25(5):27-31.
10搜索引擎怎样对网页排序[J].中学科技,2016,0(7):26-27.

东北大学学报（自然科学版）

2016年第8期

浏览历史

内容加载中请稍等...

概率XML关键字检索排序算法被引量：1

参考文献1

二级参考文献13

共引文献5

同被引文献4

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

概率XML关键字检索排序算法 被引量：1

参考文献1

二级参考文献13

共引文献5

同被引文献4

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

概率XML关键字检索排序算法被引量：1