期刊文献+

一种面向软件特征定位问题的语义相似度集成方法 被引量:5

A Semantic Similarity Integration Method for Software Feature Location Problem
下载PDF
导出
摘要 特征是软件系统中被需求所定义的可执行功能实体.识别软件特征与源代码间映射关系的过程被称作特征定位.基于信息检索的特征定位方法由于高易用性和低开销等优点,被广泛应于软件维护、代码搜索等领域.所有基于信息检索的特征定位方法均建立在语义相似度计算基础之上,当前语义相似度计算存在2个主要问题:第一,源代码数据中大量噪声信息对相似度计算的干扰;第二,不同索引方法局限性导致的相似度计算结果失准.针对这2个问题,提出了一种面向软件特征定位问题的语义相似度集成方法.该方法在预处理过程引入词性过滤,有效过滤源代码中噪声数据,提升相似性计算的准确度.然后,以源代码数据自身结构特性为依据,集成不同索引方法进行相似度计算.在公开数据集上进行了实验,与现有方法相比,词性过滤和相似度集成在平均排序倒数性能上分别带来了30.88%和10.28%的提升,验证了所提方法的有效性. Feature is an executable function entity that’s defined in software system.The process of identifying the mapping relationship between the software features and source code is called feature location.Information retrieval feature location method is widely used in software maintenance,code search and other fields because of its high usability and low overhead.All the information retrieval feature location methods are based on semantic similarity calculation.However,there are two main problems:1)There is a lot of noise data in the source code corpus.The noise data will interfere with the result of similarity calculation.2)Different index methods’limitation will lead to the similarity calculation results being inaccurate.To solve these problems,a semantic similarity integration method for software feature location problem is proposed.This method introduces the Part-of-Speech filtering in the preprocessing process,effectively filtering the source code noise data,and improving the accuracy of similarity calculation.Then,different index methods are integrated to calculate similarities based on the source code’s structured characteristics.Experiments are performed on the open data benchmarks.Compared with the existing methods,the POS filtering improves by an average of 30.88%on the mean reciprocal rank performance,while similarity integration improves an average of 10.28%.The experimental result verifies the effectiveness of the proposed methods.
作者 何云 李彤 王炜 李响 兰微 He Yun;Li Tong;Wang Wei;Li Xiang;Lan Wei(College of Software,Yunnan University,Kunming 650091;Key Laboratory for Software Engineering of Yunnan Province (Yunnan University),Kunming 650091)
出处 《计算机研究与发展》 EI CSCD 北大核心 2019年第2期394-409,共16页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61462092 61379032 61662085) 云南省自然科学基金重点项目(2015FA014) 云南省数据驱动的软件工程创新团队项目(2017HC012) 云南大学研究生科研创新基金项目(YDY17094)~~
关键词 特征定位 信息检索 语义相似度 词性过滤 索引方法 集成 feature location information retrieval semantic similarity POS filtering index method integration
  • 相关文献

参考文献4

二级参考文献42

  • 1王映辉,张世琨,刘瑜,王立福.基于可达矩阵的软件体系结构演化波及效应分析[J].软件学报,2004,15(8):1107-1115. 被引量:47
  • 2徐宝文,聂长海,史亮,陈火旺.一种基于组合测试的软件故障调试方法[J].计算机学报,2006,29(1):132-138. 被引量:38
  • 3李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量:114
  • 4王丽娟,关守义,王晓龙,王熙照.基于属性权重的Fuzzy C Mean算法[J].计算机学报,2006,29(10):1797-1803. 被引量:45
  • 5Poshyvanyk D, Gudh6neuc Y G, Marcus A, et al. Feature location using probabilistic ranking of methods based on execution scenarios and infomlation retrieval[J]. IEEE Trans- actions on Software Engineering, 2007, 33(6): 420-432.
  • 6Dit B, Revelle M, Gethers M, et al. Feature location in source code: a taxonomy and survey[J]. Journal of Software: Evo- lution and Process, 2013, 25(1): 53-95.
  • 7Li Tong. An approach to modelling software evolution pro- cesses[M]. Berlin: Springer, 2008.
  • 8Seacord R C, Plakosh D, Lewis G A. Modernizing legacy systems: software technologies, engineering processes, and business practices[M]. [S.1.]: Addison-Wesley Professional, 2003.
  • 9Wilde N, Gomez J A, Gust T, et al. Locating user functionality in old code[C]//Proceedings of the 1992 International Con- ference on Software Maintenance (ICSM '92), Orlando, USA, 1992. Washington, DC, USA: IEEE Computer Society, 1992: 200-205.
  • 10Wong W E, Wei Tingting, Qi Yu, et al. A crosstab-based statis- tical method for effective fault localization[C]//Proceedings of the 1st International Conference on Software Testing, Verification, and Validation (ICST '08), Lillehammer, Nor- way, Apr 9-11, 2008. Washington, DC, USA: IEEE Computer Society, 2008: 42-51.

共引文献33

同被引文献57

引证文献5

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部