一种基于命名实体识别的需求跟踪方法被引量：1

Recovering traceability links using named entity recognition

下载PDF

导出

摘要针对基于文本的需求跟踪方法严重依赖文本质量的问题,提出了一种利用命名实体识别技术标注制品文档关键词的需求跟踪方法。该方法通过代码实体上下文构建命名实体识别模型,解决了抽象语法树和正则表达式无法解析非源代码形式的软件制品问题。利用命名实体识别模型标志出软件制品中的代码实体之后,该方法将软件制品转换为文档集合并进行语义聚类,最后再通过映射算法创建制品间的需求跟踪关系。实验结果表明,与基于所有词项和基于高权重词项的需求跟踪方法相比,该方法能够有效提高需求跟踪结果的质量。 Aiming at the problem that requirement traceability approaches based on textual information were rely heavily on the quality of the text, this paper proposed a traceability approach utilized named entity recognition technology to identify key words in software artefacts. Firstly, the proposed method constructed a named entity recognition model through the context of code entity, which solved the issue that abstract syntax tree and the regular expression was not able to parse non-source form software artefacts. After that, the proposed method transformed software artefacts to document set, and then carried out a se- mantic clustering process to cluster documents. Finally, the proposed method created trace links between software artefacts using the mapping algorithm. The experimental results show that comparing with those traceability approaches based on the all terms and high weight terms, this method is able to effectively improve the quality of requirement tracing results.

作者王金水薛醒思唐郑熠

机构地区福建工程学院信息科学与工程学院

出处《计算机应用研究》 CSCD 北大核心 2016年第1期132-135,146,共5页 Application Research of Computers

基金国家自然科学基金资助项目(61402108) 福建省中青年教师教育科研项目(JA15348 JA13227 JB12146) 福建省科技厅高校项目(JK2012033) 福建工程学院科研启动基金资助项目(GY-Z13113 GY-Z14068)

关键词需求跟踪命名实体识别语义聚类自然语言处理权重计算 requirement traceability named entity recognition semantic clustenng natural language process term weigh-ting

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献25

1Gotel O C,Finkelstein A C.An analysis of the requirements traceability problem[C] //Proc of the 1st International Conference on Requirements Engineering.[S.l.] :IEEE Press,1994:94-101.
2Ali N,Gueneuc Y,Antoniol G.Trustrace:mining software repositories to improve the accuracy of requirement traceability links[J].IEEE Trans on Software Engineering,2013,39(5):725-741.
3Pohl K.Process-centered requirements engineering[M].[S.l.] :Wiley,1996.
4Aurum A,Wohlin C.Engineering and managing software requirements[M].[S.l.] :Springer,2005.
5Maia M D A,Lafetá R F.On the impact of trace-based feature location in the performance of software maintainers[J].Journal of Systems and Software,2013,86(4):1023-1037.
6Bavota G,De Lucia A,Oliveto R,et al.The role of artefact corpus in lsi-based traceability recovery[C] //Proc of International Workshop on Traceability in Emerging Forms of Software Engineering.[S.l.] :IEEE Press,2013:83-89.
7Peng Xin,Xing Zhenchang,Tan Xi,et al.Improving feature location using structural similarity and iterative graph mapping[J].Journal of Systems and Software,2013,86(3):664-676.
8Diaz D,Bavota G,Marcus A,et al.Using code ownership to improve IR-based traceability link recovery[C] //Proc of the 21st IEEE International Conference on Program Comprehension.San Francisco,CA:IEEE Press,2013:123-132.
9Grant S,Cordy J R,Skillicorn D B.Using heuristics to estimate an appropriate number of latent topics in source code analysis[J].Science of Computer Programming,2013,78(9):1663-1678.
10Panichella A,Dit B,Oliveto R,et al.How to effectively use topic models for software engineering tasks? An approach based on genetic algorithms[C] //Proc of the 35th International Conference on Software Engineering.Piscataway,NJ:IEEE Press,2013:522-531.

二级参考文献26

1季姮,罗振声.基于统计和规则的中文姓名自动辨识[J].语言文字应用,2001(1):14-18. 被引量：13
2Gotel O C Z, Finkelstein C W. An analysis of the requirements traceability problem [C] //Proc of the 1st IEEE Int Conf on Requirements Engineering. Los Alamitos, CA: IEEE Computer Society, 1994:94-101.
3Ferrari A, Gnesi S, Tolomei G. A clustering based approach for discovering flaws in requirements specifications [Q]//Proc of the 27th Annual ACM Symp on Applied Computing. New York: ACM, 2012:104:3-1050.
4Hayes J H, Dekhtyar A, Sundaram S K. Advancing candidate link generation for requirements tracing: The study of methods [J]. IEEE Trans on Software Engineering, 2006, 32(1) : 4-19.
5Aurum A, Wohlin C. Software and Systems Traceability [M]. Berlin: Springer, 2012.
6Cleand-Huang J, Settimi R, Duan C, et al. Utilizing supporting evidence to improve dynamic requirements traceability [C] //Proc of the 13th IEEE Int Conf on Requirements Engineering. Los Alamitos, CA: IEEE Computer Society, 2005:135-144.
7Aurum A, Wohlin C. Engineering and Managing Software Requirements [M]. Berlin: Springer, 2005.
8Pohl K. Process Centered Requirements Engineering [M]. New York John Wiley Sons, 1997.
9Chen Xiaofan, Grundy J. Improving automated documentation to code traceability by combining retrieval techniques [C] //Proc of the 26th IEEE/ACM Int Conf on Automated Software Engineering. Los Alamitos, CA IEEE Computer Society, 2011 223-232.
10Wang Xiaobo, Lai Guanhui, Liu Chao. Recovering relationships between documentation and source code based on the characteristics of software engineering [J]. Electronic Notes in Theoretical Computer Science, 2009, 243 : 121-137.

共引文献40

1彭骁男,周兰江,张建安,周枫.融合多特征的老挝语人名地名命名实体识别[J].中国水运（下半月）,2020,20(3):74-77. 被引量：1
2夏赟,李志蜀.基于统计的中文机构名自动识别[J].四川大学学报（自然科学版）,2009,46(3):613-617. 被引量：1
3刘智文.利用系统整合提高中文分词精度的方法研究[J].现代计算机,2009,15(10):7-10.
4韩普,姜杰.HMM在自然语言处理领域中的应用研究[J].计算机技术与发展,2010,20(2):245-248. 被引量：16
5唐旭日,陈小荷,许超,李斌.基于篇章的中文地名识别研究[J].中文信息学报,2010,24(2):24-32. 被引量：18
6唐旭日,陈小荷,张雪英.中文文本的地名解析方法研究[J].武汉大学学报（信息科学版）,2010,35(8):930-935. 被引量：41
7冯鲸华,古丽拉.阿东别克,玛依来.哈帕尔.基于N-gram语言模型的哈萨克文机构名识别[J].计算机工程与应用,2010,46(31):135-138. 被引量：2
8王昌厚.基于条件随机场的中文命名体识别[J].福建电脑,2012,28(2):89-89. 被引量：2
9胡万亭,杨燕,尹红风,贾真,刘利.一种基于词频统计的组织机构名识别方法[J].计算机应用研究,2013,30(7):2014-2016. 被引量：15
10曾镇,吕学强,李卓.搜索日志中中文人名的自动识别[J].现代图书情报技术,2014(12):71-77. 被引量：1

同被引文献10

1李引,李娟,李明树.动态需求跟踪方法及跟踪精度问题研究[J].软件学报,2009,20(2):177-192. 被引量：14
2王金水,翁伟,彭鑫.一种基于句法分析的跟踪关系恢复方法[J].计算机研究与发展,2015,52(3):729-737. 被引量：5
3郑培真,苑春春,刘超,吴际,杨海燕,胡宁.面向软件安全性需求分析过程的追踪模型[J].计算机科学,2017,44(4):30-34. 被引量：2
4胡成海,彭蓉,王帮超.基于信息检索的需求跟踪方法综述[J].计算机应用与软件,2017,34(10):20-28. 被引量：5
5王飞,黄志球,杨志斌,阚双龙,沈国华,陈光颖.一种安全攸关嵌入式系统需求追踪方法[J].计算机学报,2018,41(3):652-669. 被引量：7
6唐晨,李勇华,饶梦妮,胡钢俊.动态需求跟踪中多义关键词的语义判断方法[J].计算机应用,2019,39(5):1299-1304. 被引量：3
7邓刘梦,沈国华,黄志球,王飞,葛晓瑜.扩展SysML支持需求追踪模型的自动生成[J].计算机科学与探索,2019,13(6):950-960. 被引量：3
8李潇,魏长江.多视点元模型间需求追踪性方法[J].计算机系统应用,2019,28(9):41-49. 被引量：1
9Tian-bao DU,Guo-hua SHEN,Zhi-qiu HUANG,Yao-shen YU,De-xiang WU.Automatic traceability link recovery via active learning[J].Frontiers of Information Technology & Electronic Engineering,2020,21(8):1217-1225. 被引量：3
10杜天保,沈国华,黄志球,王飞,吴德香.通过代码模式改进基于IR的需求和代码之间追踪生成方法[J].小型微型计算机系统,2019,40(5):1107-1114. 被引量：1

引证文献1

1陶传奇,张萌,郭虹静,黄志球.面向不同软件制品的需求追踪方法研究综述[J].计算机学报,2022,45(11):2393-2419. 被引量：1

二级引证文献1

1管博伦,董伟,张立平,杨前进,汪焱.再生稻溯源追踪平台研发[J].农业大数据学报,2023,5(1):55-67.

1司马刚.使用Visual Studio团队开发版进行项目管理[J].程序员,2006(10):124-125.
2严彩梅.Web用户模式[J].扬州大学学报（自然科学版）,2002,5(3):53-56. 被引量：3
3姜芳,李国和,岳翔.基于语义的文档关键词提取方法[J].计算机应用研究,2015,32(1):142-145. 被引量：10
4王金水,翁伟,彭鑫.一种基于句法分析的跟踪关系恢复方法[J].计算机研究与发展,2015,52(3):729-737. 被引量：5
5王灿辉,张敏,马少平,黄宇.基于相邻词的中文关键词自动抽取[J].广西师范大学学报（自然科学版）,2007,25(2):161-164. 被引量：10
6王燕.基于相邻词的中文关键词自动抽取研究[J].科技致富向导,2012(26):84-84.
7黄国森,王燕兴.需求跟踪技术研究[J].计算机工程与科学,2006,28(z2):181-182. 被引量：1
8李亚.面向对象软件概要设计过程[J].福建电脑,2008(6):48-49. 被引量：7
9林洋港,陈恩红.文本分类中基于概率主题模型的噪声处理方法[J].计算机工程与科学,2010,32(7):89-92. 被引量：9
10卢小雷.【无拘无束】 Canon imageCLASS MF4420w激光多功能一体机[J].个人电脑,2012,18(5):24-24.

计算机应用研究

2016年第1期

浏览历史

内容加载中请稍等...

一种基于命名实体识别的需求跟踪方法被引量：1

参考文献25

二级参考文献26

共引文献40

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于命名实体识别的需求跟踪方法 被引量：1

参考文献25

二级参考文献26

共引文献40

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于命名实体识别的需求跟踪方法被引量：1