基于排序学习的构件检索方法的研究

A component retrieval method based on learning to rank

下载PDF

导出

摘要将排序学习的方法应用于构件检索的研究中,首先,采用刻面描述的方法对构件进行全面的描述,并通过word2vec模型和权重设定的方法对刻面描述的构件进行特征提取;然后,对构件特征进行潜在语义分析和余弦相似度计算,得到构件训练数据集;最后,通过使用构件训练数据集和构件数据集对经过改进的Plackett-Luce概率排序模型用最大似然估计方法训练模型参数,从而得到一种构件排序模型。将构件排序模型应用到构件检索中开发实现了一个构件检索方法,通过实验验证了此方法的有效性,其查全率、查准率和效率都优于传统的构件检索方法。 This paper applies the method of learning to rank to the research of component retrieval.Firstly,the facet description method is used to describe the components comprehensively,and the features of the facet described components are extracted through the Word2vec model and the weight setting method.Secondly,the component semantics analysis and cosine similarity calculation are performed on the component feature description information to obtain the component training data set.Finally,the component training data set and the component data set are used to train the model parameters of the improved Plackett-Luce probabilistic ranking model through the maximum likelihood estimation method,so as to obtain a component ranking model.The component ranking model is applied to the component retrieval to realize a component retrieval method.Experiments show that the method has better effectiveness,recall,precision and efficiency are better than the traditional component retrieval methods.

作者陈华烨汪海涛姜瑛陈星 CHEN Hua-ye;WANG Hai-tao;JIANG Ying;CHEN Xing(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China)

机构地区昆明理工大学信息工程与自动化学院

出处《计算机工程与科学》 CSCD 北大核心 2021年第6期1006-1013,共8页 Computer Engineering & Science

基金国家自然科学基金(61462049)。

关键词排序学习构件检索潜在语义分析最大似然估计 learning to rank component retrieval latent semantic analysis maximum likelihood estimation

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献4

1贾晓辉,陈德华,严梅,乐嘉锦,丁晓东.基于刻面描述的构件查询匹配模型及算法研究[J].计算机研究与发展,2004,41(10):1634-1638. 被引量：28
2王忠杰,徐晓飞,战德臣.基于特征的构件模型及其规范化设计过程[J].软件学报,2006,17(1):39-47. 被引量：24
3罗燕,赵书良,李晓超,韩玉辉,丁亚飞.基于词频统计的文本关键词提取方法[J].计算机应用,2016,36(3):718-725. 被引量：77
4王惠文,黄乐乐,王思洋.基于函数型数据的广义线性回归模型[J].北京航空航天大学学报,2016,42(1):8-12. 被引量：7

二级参考文献38

1S Torshen, F Naumann. Approximate tree embedding for querying XML data. ACM SIGIR 2000 Workshop on XML and Information Retrieval, Athen, Greece, 2000
2Dennis Shasha, Jason T L Wang. ATreeGrep: Approximate search in unordered trees. The 14th Int'l Conf on Science and Statistical Database Management, Edinburgh, Scotland, 2002
3D Shasha, J Tsong, L Wang. Exact and approximate algorithm for unordered tree matching. IEEE Trans on Systems, Man and Cybernetics, 1994, 24(4): 668～678
4K Z Zhang, R Statman, D Shasha. On the editing distance between unordered labeled trees. Information Processing Letters,1992, 42(3): 133～139
5Pascal Ferraro, Christophe Godin. An edit distance between quotiented trees. Algorithmic a, 2003, 36(1): 1～39
6R Thorsten. A new measure of the distance between ordered trees and its applications. Department of Computer Science, University of Bonn, Tech Rep. 85166, 1997
7ABILHOA W D, CASTRO L N D. A keyword extraction method from twitter messages represented as graphs [ J]. Applied Mathematics and Computation, 2014, 240(4) : 308 - 325.
8CHEN Y H, LU J L, MENG F T. Finding keywords in blogs: efficient keyword extraction in blog mining via user behaviors [ J]. Expert Systems with Applications, 2014, 41(2):663 -670.
9JEAN-LOUIS L, GAGNON M, CHARTON E. A knowledge-base o-riented approach for automatic keyword extraction [ J]. Computacion y Sistemas, 2013, 17(2) : 187 - 196.
10HABIBI M, POPESCU-BELIS A. Keyword extraction and clustering for document recommendation in conversations [ J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2015, 23 (4) :746 -759.

共引文献131

1陈华烨,汪海涛,姜瑛,陈星.基于本体相似度与排序学习的构件检索方法研究[J].数据通信,2020(6):10-15.
2巴哈古丽·图尼亚孜,玉素甫·艾拜都拉.维吾尔语词频统计系统研究[J].电子世界,2020(3):63-64.
3钟捷妍.让心情灿烂起来[J].中等职业教育,2005(21):22-22.
4边小凡,夏华轩.一种面向构件自动化组装的构件检索方法[J].河北大学学报（自然科学版）,2005,25(6):668-672. 被引量：1
5周必水,张延红,赵敬.基于语法树的程序正确性验证模型及算法设计[J].杭州电子科技大学学报（自然科学版）,2006,26(1):1-4. 被引量：2
6张亮,赵文耘,彭鑫,肖君.基于多Agent和本体的分布式资源描述和检索[J].计算机工程与应用,2006,42(15):161-164.
7张聚广,张维石,张秀国,史金余.基于空间编码的刻面分类构件检索方法研究[J].计算机工程与应用,2006,42(17):153-156. 被引量：5
8张延红.智能CAA中基于语法树的程序正确性验证研究[J].浙江万里学院学报,2006,19(5):12-15.
9钟春平,郭国平,郑有才.跨构件库的刻面描述构件查询匹配算法研究[J].计算机工程,2006,32(21):82-84. 被引量：4
10曹晓兰,焦海星.领域特征分析建模的研究及应用[J].计算机系统应用,2006,15(12):21-24. 被引量：3

1刘小雨.大足石刻观经变中栏杆的时代印记及艺术痕迹[J].文物鉴定与鉴赏,2021(10):35-37.
2陈华烨,汪海涛,姜瑛,陈星.基于本体相似度与排序学习的构件检索方法研究[J].数据通信,2020(6):10-15.
3孟宏伟,罗昊宇,吕一星,冯彤.休闲疗养类健康旅游服务机构的评价指标体系研究[J].中国初级卫生保健,2021,35(5):1-5. 被引量：1
4张洁.刍议高中数学数列试题的解题方法与技巧[J].试题与研究（教学论坛）,2021(13):116-116. 被引量：1
5裴艳宇,杨小彬,传金平,吴学松,程虹铭,吕祥锋.一维卷积神经网络特征提取下微震能级时序预测[J].工程科学学报,2021,43(7):1003-1009. 被引量：12

计算机工程与科学

2021年第6期

浏览历史

内容加载中请稍等...

基于排序学习的构件检索方法的研究

参考文献4

二级参考文献38

共引文献131

相关作者

相关机构

相关主题

浏览历史