期刊文献+

软件仓库挖掘领域:贡献者和研究热点 被引量:4

Mining Software Repositories:Contributors and Hot Topics
下载PDF
导出
摘要 随着时间的推移,软件不断地更新和演化,软件仓库中累积了海量的数据,如何有效地收集、组织、利用软件工程中涌现的软件大数据是一个至关重要的问题.软件仓库挖掘(mining software repositories,MSR)通过挖掘软件仓库中繁杂多变的数据中蕴含的知识来提高软件的质量和生产效率.虽然一些研究工作详细阐述了MSR的背景、历史和前景,但现有的研究工作并未系统地呈现MSR领域中最有影响力的作者、机构、国家以及最受欢迎的研究主题和主题变迁等领域知识.因此,结合已有的经典的文献分析框架和算法来分析MSR相关文献,并呈现一些MSR基本领域知识.为了实现MSR文献分析,建立了一个包含3个组件的MSR文献分析框架(MSR publication analysis framework,MSR-PAF),这3个组件分别被用来创建数据集、执行基础文献分析、实施合作模式分析.基础文献分析结果表明:最高产的作者、机构、国家?地区分别是Ahmed E.Hassan,University of Victoria和美国,最有影响力作者是Ahmed E.Hassan,最频繁的关键词是software maintenance.合作模式分析的结果显示Abram Hindle是MSR领域最活跃的作者,open source project和software maintenance是最流行的研究主题. Software updates and evolves continuously over time,software repositories accumulatemassive data.H o w to effectively collect,organize,and m a k e use of these data has become a keyproblem in software engineering.Mining Software Repositories(M S R)aim to mine useful knowledgecontained in complex and diversified data to improve the quality and productivity of software.Although some studies have elaborately summarized the background,history,and prospects aboutM S R,existing studies do not present systematically the most influential author,institution,andcountry as well as the major research topics and their transitions over time.Therefore,this studycombines the existing classical publication analysis frameworks and algorithms to analyze therelationships a m o n g publications related to M S R,and presents some important domain knowledge forresearchers in detail.T o effectively tackle this task,w e construct a framework n a m e d M S RPublication Analysis F r a m e w o r k(M S R-P A F).M S R-P A F consists of three components which can beused to create a dataset for the study,conduct a bibliography analysis,and implement a collaborationpattern analysis?respectively.T h e results of the bibliography analysis s h o w that the most productiveauthor,institution,and country are A h m e d E.H a s s a n,University of Victoria,and U S A,respectively.T h e most frequent keyword is software maintenance and the most influential author isA b r a m Hindle.In addition,the results of the collaboration pattern analysis s h o w that A b r a m Hindleis the most active author,and open source project and software maintenance are the most popularresearch topics.
作者 江贺 陈信 张静宣 韩雪娇 徐秀娟 Jiang H e;Che n X i n;Zhang Jingxuan;H a n Xuejiao;X u Xiujuan(School of Software,Dalian University of Technology,Dalian,Liaoning 116024)
出处 《计算机研究与发展》 EI CSCD 北大核心 2016年第12期2768-2782,共15页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61370144) 教育部新世纪优秀人才支持计划基金项目(NCET-13-0073)~~
关键词 文献分析 合作模式分析 数据挖掘 软件仓库挖掘 大数据 publication analysis collaboration pattern analysis data mining mining software repositories big data
  • 相关文献

参考文献1

二级参考文献24

  • 1Xie T,Pei J,Hassan A E.Mining software engineering data[C]∥Proceedings of the 29th International Conference on Software Engineering(ICSE’2007).2007:172-173.
  • 2Hassan A E,Xie T.Software intelligence:the future of miningsoftware engineering data[C]∥Proceedings of the FSE/SDP workshop on Future of Software Engineering Research(FoSER’2010).2010:161-166.
  • 3Xie T,Thummalapenta S,Lo D,et al.Data mining for software engineering [J].Computer,2009,42(8):55-62.
  • 4Srinivasa K G,Venugopal K R,Patnaik L M.Feature extraction using fuzzy c-means clustering for data mining systems[J].International Journal of Computer Science and Network Security,2006,6(3A):230-236.
  • 5Sun C,Lo D,Khoo S C,et al.Towards more accurate retrieval of duplicate bug reports[C]∥Proceedings of 2011 26th IEEE/ACM International Conference on Automated Software Engineering(ASE’11).2011:253-262.
  • 6Anvik J,Hiew L,Murphy G C.Who should fix this bug? [C]∥Proceedings of the 28th International Conference on Software Engineering(ICSE’06).2006:361-370.
  • 7Jeong G,Kim S,Zimmermann T.Improving bug triage with bug tossing graphs[C]∥Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on The Foundations of Software Engineering(FSE’09).2009:111-120.
  • 8Xuan J,Jiang H,Ren Z,et al.Developer prioritization in bug repositories[C]∥Proceedings of the 34th International Confe-rence on Software Engineering(ICSE’12).2012:25-35.
  • 9Mani S,Catherine R,Sinha V S,et al.Ausum:approach for unsupervised bug report summarization[C]∥Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering(FSE’12).2012:11-21.
  • 10Lotufo R,Malik Z,Czarnecki K.Modelling the ‘hurried’ bug report reading process to summarize bug reports[C]∥Proceedings of the 28th IEEE International Conference on Software Maintenance(ICSM’12).2012:430-439.

共引文献3

同被引文献9

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部