基于软件演化历史识别并推荐重构克隆的方法被引量：1

Method for Identifying and Recommending Reconstructed Clones Based on Software Evolution History

下载PDF

导出

摘要现有克隆代码重构研究局限于单一版本的静态分析,忽略了克隆代码的演化过程,这导致在克隆代码重构决策方面缺乏有效的方法。因此文中首先从克隆检测、克隆映射、克隆家系以及软件维护日志管理系统中提取与克隆代码密切相关的演化历史信息;其次识别出需要重构的克隆代码,同时识别出跟踪的克隆代码,然后提取与重构相关的静态特征和演化特征,并构建特征样本数据库;最后对比多种机器学习的方法对,选出效果最佳的分类器推荐重构克隆。在7款软件近170个版本上进行的实验表明,推荐重构克隆代码的准确度达到90%以上,这为软件开发和维护人员提供了更加准确、合理的代码重构建议。 The research on the existing clone code reconstruction is limited to a single version of static analysis while ignoring the evolution process of the cloned code,resulting in a lack of effective methods for reconstructing the cloned code.Therefore,this paper firstly extracted the evolution history information closely related to the clone code from clone detection,clone mapping,clone family and software maintenance log management system.Secondly,the clone code that needs to be reconstructed was identified,and the traced clone code was identified at the same time.Then,static features and evolution features were extracted and reconstructed and a feature sample database was built.Finally,a variety of machine learning methods were used to compare and select the best classifier recommended reconstruction of clones.In this paper,experiments were performed on nearly 170 versions of 7 software.The results show that the readiness for reconstructing cloned code is more than 90%.It provides more accurate and reasonable code reconstruction suggestions for software development and maintenance personnel.

作者折蓉蓉张丽萍 SHE Rong-rong;ZHANG Li-ping(College of Computer and Information Engineering,Inner Mongolia Normal University,Hohhot 010022,China)

机构地区内蒙古师范大学计算机与信息工程学院

出处《计算机科学》 CSCD 北大核心 2019年第8期224-232,共9页 Computer Science

基金国家自然科学基金资助项目(61462071) 内蒙古自然科学基金资助项目(2018MS06009) 内蒙古教育厅资助项目(NJZY17049) 内蒙古师范大学科研基金项目(2016ZRYB003)资助

关键词克隆代码克隆重构克隆跟踪克隆家系特征提取 Code clone Clone refactoring Clone tracking Clone family Feature extraction

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1折蓉蓉,张丽萍,侯敏,闫盛.基于决策树推荐克隆重构的方法[J].计算机应用,2018,38(7):2037-2043. 被引量：3
2王欢,张丽萍,闫盛,刘东升.克隆代码有害性预测中的特征选择模型[J].计算机应用,2017,37(4):1135-1142. 被引量：2
3张久杰,王春晖,张丽萍,侯敏,刘东升.基于Token编辑距离检测克隆代码[J].计算机应用,2015,35(12):3536-3543. 被引量：13
4葛广帅,刘东升,侯敏.基于LDA和DBSCAN的软件多版本克隆群映射方法[J].计算机应用研究,2017,34(2):481-486. 被引量：2
5张瑞霞,张丽萍,王春晖,侯敏.基于主题建模技术的克隆群映射方法[J].计算机工程与设计,2015,36(6):1524-1529. 被引量：11
6葛广帅,刘东升,张丽萍,侯敏.基于图模型的克隆代码演化痕迹构建及模式识别[J].计算机工程,2017,34(5):47-54. 被引量：3
7刘冬瑞,刘东升,张丽萍,侯敏,王春晖.基于贝叶斯网络预测克隆代码质量[J].计算机科学,2017,44(4):165-168. 被引量：4

二级参考文献51

1Bettenburg N, Shang W, Ibrahim W, et al. An empirical study on inconsistent changes to code clones at release level [C] //Proc of the 16th Working Conference on Reverse Engi- neering. IEEE Press, 2009: 85-94.
2Zibran M F, Roy C K. The road to software clone manage- ment: A survey [R]. Technical Report, The University of Saskatchewan, 2012: 1-66.
3Saha R K, Asduzzaman M, Zibran M F, et al. Evaluating code clone genealogies at release level: An empirical study [C] //Proceedings of the 10th IEEE Working Conference on Source Code Analysis and Manipulation. Washington DC: IEEE Computer Society, 2010: 87-96.
4Bakota T, Ferenc R, Gyimothy T. Clone smells in software evolution [C] //IEEE International Conference on Software Maintenance. Washington DC: IEEE Computer Society, 2007 : 24-33.
5Saha R K, Roy C K, Schneider K A. An automatic framework for extracting and classifying near-miss clone genealogies [C] //27th IEEE International Conference on Software Main- tenance, ZO11 29-302.
6Barbour L, Khomh F, Zou Y. Late propagation in software clones [C] //Proceedings of the 27th IEEE International Con- ference on Software Maintenance. Washington DC: IEEE Computer Society, 2011: 273-282.
7Gode N, Koschke R. Incremental clone detection [C] //Pro- ceedings of the European Conference on Software Maintenance and Reengineering. Washington DC: IEEE Computer Society, 2009 : 219-228.
8Duala-Ekoko E, Robillard M P. Tracking code clones in evol- ving software [C] //Proceedings of the 29th International Conference on Software Engineering. Washington DC: IEEE Computer Society, 2007 .. 158-167.
9Grant S, Cordy J. Estimating the optimal number of latent con- cepts in source code analysis [C] //10th IEEE Working Conference on Source Code Analysis and Manipulation, 2010: 65-74.
10Lukins S, Kraft N, Etzkorn L. Bug localization using latent Diriehlet allocation [J]. Information and Software Technolo- gy, 2010, 52 (9): 972-990.

共引文献25

1许能闯,袁健,高喜龙.含代码的IT社区答案质量评价模型[J].小型微型计算机系统,2019,40(1):158-163. 被引量：1
2张丽萍,张瑞霞,王欢,闫盛.基于贝叶斯网络的克隆代码有害性预测[J].计算机应用,2016,36(1):260-265. 被引量：8
3张久杰,翟晔,王春晖,张丽萍,刘东升.基于版本间克隆映射的演化模式识别及谱系构建[J].计算机应用,2016,36(7):2021-2030. 被引量：4
4陈桌,张丽萍,王欢,张久杰,王春晖.基于改进向量空间模型的克隆群映射方法[J].计算机应用,2016,36(7):2031-2037. 被引量：3
5陈桌,张丽萍,王春晖.基于软件代码演化信息的克隆谱系提取方法[J].计算机应用,2016,36(12):3461-3467.
6王欢,张丽萍,闫盛.克隆代码有害性预测中分类不平衡问题的解决方法[J].计算机应用,2016,36(12):3468-3475.
7葛广帅,刘东升,侯敏.基于LDA和DBSCAN的软件多版本克隆群映射方法[J].计算机应用研究,2017,34(2):481-486. 被引量：2
8王欢,张丽萍,闫盛,刘东升.克隆代码有害性预测中的特征选择模型[J].计算机应用,2017,37(4):1135-1142. 被引量：2
9葛广帅,刘东升,张丽萍,侯敏.基于图模型的克隆代码演化痕迹构建及模式识别[J].计算机工程,2017,34(5):47-54. 被引量：3
10王春晖,张久杰,刘志国,张丽萍,刘东升.基于演化模式特征的克隆代码分类[J].计算机工程与设计,2017,38(8):2121-2126.

同被引文献2

1ZHANG Fanlong,KHOO Siau-Cheng,SU Xiaohong.Machine-Learning Aided Analysis of Clone Evolution[J].Chinese Journal of Electronics,2017,26(6):1132-1138. 被引量：1
2苏小红,张凡龙.面向管理的克隆代码研究综述[J].计算机学报,2018,41(3):628-651. 被引量：8

引证文献1

1欧阳鹏,陆璐,张凡龙,邱少健.基于迁移学习和过采样技术的跨项目克隆代码一致性维护需求预测[J].计算机科学,2020,47(9):10-16.

1侯敏,张丽萍.克隆代码检测技术研究[J].计算机技术与发展,2019,29(8):86-91. 被引量：1
2韩冬楠,边坤,韦贝贝.蒙古族图案元素提取与重构[J].包装工程,2019,40(6):1-7. 被引量：25
3张志浩,杨春花.基于代码克隆检测的抽取方法重构模式识别[J].计算机应用与软件,2019,36(9):12-15. 被引量：1
4郭秀虎.信息技术在初中物理演示实验教学中的应用[J].西部素质教育,2019,5(16):118-119. 被引量：5
5王静.MongoDB日志管理系统探析——以《全国报刊索引》平台为例[J].湖北大学学报（自然科学版）,2019,41(3):318-324.
6于述春,林晶,黄斌.基于软件工程思想的软件测试教学研究与实践[J].信息系统工程,2019,32(7):174-175. 被引量：1
7孔春雷,王泽昊.基于软件无线电的测距系统[J].数字技术与应用,2019,37(7):83-84. 被引量：5
8金旭,王家峰,刘慧军,杜宪峰.基于软件仿真的发动机建模研究[J].汽车实用技术,2019,0(17):56-58. 被引量：1
9杨佳,付才,韩兰胜,鲁宏伟,刘京亮.云环境下基于函数编码的移动应用克隆检测[J].通信学报,2019,40(8):60-71.
10周栋梁.“实验：测定电池的电动势和内阻”的重构建议[J].教学月刊（中学版）（教学参考）,2019,0(7):97-101.

计算机科学

2019年第8期

浏览历史

内容加载中请稍等...

基于软件演化历史识别并推荐重构克隆的方法被引量：1

参考文献7

二级参考文献51

共引文献25

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于软件演化历史识别并推荐重构克隆的方法 被引量：1

参考文献7

二级参考文献51

共引文献25

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于软件演化历史识别并推荐重构克隆的方法被引量：1