期刊文献+

代码与文档间关联关系的提取方法研究和改进 被引量:3

Analysis and Improvement on Retrieval Methods for Traceability Links between Source Code and Documentation
下载PDF
导出
摘要 在潜在语义模型的基础上融入了软件文档和程序代码的特点,提出了基于类继承关系的代码聚类、代码特征项分类加权、引入相似度词典以及基于文档类型的分类搜索这四种改进策略.实验结果表明,四种策略可以在保持查全率不变的情况下提高查准率15%左右.表明在提取代码与文档间可跟踪性链时,考虑它们的固有特点,将有助于提高检索系统的查全率和查准率. Software documentation is usually expressed in natural languages and free text, in which it captures large useful information. Establishing traceability links between documentation and source code can be helpful in Software Engineering Management. Currently, the recovery of traceability links is mostly based on information retrieval techniques, e. g., probabilisfic model, vector space model and Latent Semantic Indexing(LSI). But previous work only treats documentation and source code as plain text files without considering the features with respect to Software Engineering. Four enhancing strategies are proposed to improve the traditional LSI method based on the features of software documentation and source code,namely,source code clustering,identifiers classifying, similarity thesaurus and hierarchical structure enhancement. Experimental results show that the four enhancement strategies can increase the precision by about 15%. So, the special characteristics of documentation and source code should be considered carefully during the recovering traceability links between them.
出处 《电子学报》 EI CAS CSCD 北大核心 2009年第B04期22-30,共9页 Acta Electronica Sinica
基金 国家863项目(No.2006AA01Z176) 国家自然科学基金(No.90718018)
关键词 信息检索 可跟踪性链 程序理解 逆向工程 information retrieval(IR) traceability recovery program comprehension reverse engineering
  • 相关文献

参考文献12

  • 1Marcus A, Malefic J I, Sergeyev A. Recovery of traceability links between software documentation and source code[ J ]. International Journal of Software Engineering and Knowledge Engineering(IJSEKE), 2005,15 (5) : 811 - 836.
  • 2Antoniol G,Canfora G,Casazza G,et al. Recovering traceability links between code and documentation [ J ]. IEEE. Transactions on Software Engineering, 2002,28(10) :970- 983.
  • 3Marcus A, Malefic J I. Recovering documentation to source code traceability links using latent semantic indexing[ A]. Proceedings 25th International Conference on Software Engineering ( ICSE' 03)[C]. Portland, OR, USA, 2003. 125 - 135.
  • 4Cleland-huang J, Settimi R, Chuan D, et al. Utilizing supporting evidence to improve dynamic requirements traceability [A]. Procedings 13th IEEE International Conference on Requirements Engineering ( RE' 05 ) [C], Paris, France, 2005. 135 - 144.
  • 5钱剑飞,陈华,陈奇,俞瑞钊.一种代码与中文文档关联信息的自动提取方法[J].浙江大学学报(工学版),2004,38(11):1417-1421. 被引量:2
  • 6Salton G, Buckley C. Retrieval performance by relevance feed- back[J]. Journal of the American Society for Information Science, 1990,41 (4) :288 - 297.
  • 7De Lucia A, Ofiveto R, Sgueglia P. Incremental approach and user feedbacks: A silver bullet for traceability recovery [ A ]. Proceedings 22nd IEEE International Conference on Software Maintenance(ICSM' 06) [C]. Philadelphia, USA, 2006. 299 - 309.
  • 8P C H, Raghavan P. Latent semantic indexing: A probabilistic analysis[ J]. Journal of Computer and System Sciences, 2000 (61) :217 - 235.
  • 9Hayes J H, Dekhtyar A, Osborne J. Improving requirements tracing via information retrieval[ A ]. Proceedings 11th IEEE, International on Requirements Engineering Conference [C]. Monterey, CA, USA, 2003. 138 - 147.
  • 10王映辉,王立福,张世琨,王琼芳.一种软件需求变化追踪方法[J].电子学报,2006,34(8):1428-1432. 被引量:9

二级参考文献15

  • 1王映辉,王立福.软件体系结构演化模型[J].电子学报,2005,33(8):1381-1386. 被引量:23
  • 2BIGGERSTAFF T. Design recovery for maintenance and reuse [J]. IEEE Computer, 1989,22(7): 36 - 49.
  • 3BIGGERSTAFF T, MITBANDER B. The concept assignment problem in program understanding [A]. Proceeding of the 15th International Conference on Software Engineering [C]. Baltimore: [s.n.], 1993, 5:482 -498.
  • 4ANTONIOL G, CANFORA G, CASAZZA G, et al. Tracing object-oriented code into functional requirements [A]. 8th International Workshop on Program Comprehension [C]. Limerick: [s. n.], 2000: 227-230.
  • 5ANTONIOL G,CANFORA G,CASAZZA G,et al. Recovering traceability links between code and documentation [J]. IEEE Transactions on Software Engineering,2002, 28(10): 970 - 983.
  • 6ANTONIOL G, CAPRILE B, POTRICH A, et al.Design-code traceability for object oriented systems [J].The Annals of Software Engineering,2000, 9:35 -58.
  • 7HARMAM D. Ranking algorithms [J]. Data Structures and Algorithms, 1992,18(2): 363 - 392.
  • 8范大茵,陈永华.概率论与数理统计[M].杭州:浙江大学出版社,1998:26-27.
  • 9COVER T M, THMAS J A. Elements of information theory [M]. New York: Wiley, 1992.
  • 10MAAREK Y S,BERRY D M,KAISER G E. An information retrieval approach for automatically constructing software libraries [J]. IEEE Transactions on Software Engineering, 1991,17(8): 800 - 814.

共引文献9

同被引文献41

  • 1钱剑飞,何钦铭,陈华,俞瑞钊.一种基于模糊推理的细匹配方法[J].计算机工程,2007,33(8):163-164. 被引量:1
  • 2ALIAS C, BARTHOU D. Algorithm recognition based on demand- driven data-flow analysis[C]//WCRE'03: Proceedings of the 10th Working Conference on Reverse Engineering. Washington, DC: IEEE Computer Society, 2003: 296.
  • 3ALLEN R, KENNEDY K. Optimizing compilers for modem archi- tectures[ M]. Klecakova: Elsevier Science, 2001.
  • 4METZGER R, WEN ZHAOFANG. Automatic algorithm recognition and replacement [ M]. Cambridge: MIT Press, 2000.
  • 5TAHERKHANI A, MALMI L, KORHONEN A. Algorithm recogni- tion by static analysis and its application in students' submissions as- sessment[ C]// Proceedings of the 8th Koli Calling International Conference on Computing Education Research. New York: ACM, 2008 : 88 - 91.
  • 6MURRAY W R. Automatic program debugging for intelligent tuto- ring systems[J]. Computational Intelligence, 1987, 3(1) : 1 - 16.
  • 7JOHNSON W L, SOLOWAY E. Proust: Knowledge-based program understanding[ J]. IEEE Transactions on Software Engineering, 1985, 11(5) : 267 -275.
  • 8KIM S, KIM J H. Algorithm recognition for programming tutoring based on flow graph parsing[ J]. Applied Intelligence, 1996, 6(2) : 153 - 164.
  • 9TAHERKHANI A. Recognizing sorting algorithm with the C4.5 de- cision tree classifier[ C]// IEEE 18th International Conference on Program Comprehension. Washington, DC: IEEE Computer Society, 2010:72 - 75.
  • 10RICH C. A formal representation for plans in the programmer's ap- prentice[ M]. San Francisco: Morgan Kaufmann Publishers Inc., 1986.

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部