Cyclomatic Complexity and Lines of Code: Empirical Evidence of a Stable Linear Relationship 被引量：1

Cyclomatic Complexity and Lines of Code: Empirical Evidence of a Stable Linear Relationship

下载PDF

导出

摘要 Researchers have often commented on the high correlation between McCabe’s Cyclomatic Complexity (CC) and lines of code (LOC). Many have believed this correlation high enough to justify adjusting CC by LOC or even substituting LOC for CC. However, from an empirical standpoint the relationship of CC to LOC is still an open one. We undertake the largest statistical study of this relationship to date. Employing modern regression techniques, we find the linearity of this relationship has been severely underestimated, so much so that CC can be said to have absolutely no explana-tory power of its own. This research presents evidence that LOC and CC have a stable practically perfect linear rela-tionship that holds across programmers, languages, code paradigms (procedural versus object-oriented), and software processes. Linear models are developed relating LOC and CC. These models are verified against over 1.2 million randomly selected source files from the SourceForge code repository. These files represent software projects from three target languages (C, C++, and Java) and a variety of programmer experience levels, software architectures, and de-velopment methodologies. The models developed are found to successfully predict roughly 90% of CC’s variance by LOC alone. This suggest not only that the linear relationship between LOC and CC is stable, but the aspects of code complexity that CC measures, such as the size of the test case space, grow linearly with source code size across lan-guages and programming paradigms. Researchers have often commented on the high correlation between McCabe’s Cyclomatic Complexity (CC) and lines of code (LOC). Many have believed this correlation high enough to justify adjusting CC by LOC or even substituting LOC for CC. However, from an empirical standpoint the relationship of CC to LOC is still an open one. We undertake the largest statistical study of this relationship to date. Employing modern regression techniques, we find the linearity of this relationship has been severely underestimated, so much so that CC can be said to have absolutely no explana-tory power of its own. This research presents evidence that LOC and CC have a stable practically perfect linear rela-tionship that holds across programmers, languages, code paradigms (procedural versus object-oriented), and software processes. Linear models are developed relating LOC and CC. These models are verified against over 1.2 million randomly selected source files from the SourceForge code repository. These files represent software projects from three target languages (C, C++, and Java) and a variety of programmer experience levels, software architectures, and de-velopment methodologies. The models developed are found to successfully predict roughly 90% of CC’s variance by LOC alone. This suggest not only that the linear relationship between LOC and CC is stable, but the aspects of code complexity that CC measures, such as the size of the test case space, grow linearly with source code size across lan-guages and programming paradigms.

作者 Graylin JAY Joanne E. HALE Randy K. SMITH David HALE Nicholas A. KRAFT Charles WARD

机构地区不详

出处《Journal of Software Engineering and Applications》 2009年第3期137-143,共7页 软件工程与应用（英文）

关键词 SOFTWARE COMPLEXITY SOFTWARE Metrics Cyclomatic COMPLEXITY Software Complexity Software Metrics Cyclomatic Complexity

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

同被引文献4

1陈诚吴逵 Venkatesh Srinivasan Kesav Bharadwaj R.The Best Answers？ Think Twice： Identifying Commercial Campagins in the CQA Forums[J].Journal of Computer Science & Technology,2015,30(4):810-828. 被引量：1
2张芸,David Lo,夏鑫,孙建伶.Multi-Factor Duplicate Question Detection in Stack Overflow[J].Journal of Computer Science & Technology,2015,30(5):981-997. 被引量：5
3Xin-Li Yang,David Lo,Xin Xia,Zhi-Yuan Wan,Jian-Ling Sun.What Security Questions Do Developers Ask？ A Large-Scale Study of Stack Overflow Posts[J].Journal of Computer Science & Technology,2016,31(5):910-924. 被引量：9
4Xianzhi Wang,Chaoran Huang,Lina Yao,Boualem Benatallah,Manqing Dong.A Survey on Expert Recommendation in Community Question Answering[J].Journal of Computer Science & Technology,2018,33(4):625-653. 被引量：13

引证文献1

1Yi-Xuan Tang,Zhi-Lei Ren,He Jiang,Xiao-Chen Li,Wei-Qiang Kong.An Empirical Comparison Between Tutorials and Crowd Documentation of Application Programming Interface[J].Journal of Computer Science & Technology,2021,36(4):856-876.

1Payel Bajpayee,Hassan Reza.Toward Quality Attribute Driven Approach to Software Architectural Design[J].Journal of Software Engineering and Applications,2017,10(6):483-499.
2Fengtian Sun.A Diachronic Study on Translation Strategies of Culturespecific Items With the Translation of Measurement Unit in Howard GoldBlatt’s Works as An Example[J].Journal of Contemporary Educational Research,2018,2(6):17-21.
3Aicha Choutri,Faiza Belala,Kamel Barkaoui.A Tile Logic Based Approach for Software Architecture Description Analysis[J].Journal of Software Engineering and Applications,2010,3(11):1067-1079.

Journal of Software Engineering and Applications

2009年第3期

浏览历史

内容加载中请稍等...

Cyclomatic Complexity and Lines of Code: Empirical Evidence of a Stable Linear Relationship 被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史