In software testing,the quality of test cases is crucial,but manual generation is time-consuming.Various automatic test case generation methods exist,requiring careful selection based on program features.Current evalu...In software testing,the quality of test cases is crucial,but manual generation is time-consuming.Various automatic test case generation methods exist,requiring careful selection based on program features.Current evaluation methods compare a limited set of metrics,which does not support a larger number of metrics or consider the relative importance of each metric to the final assessment.To address this,we propose an evaluation tool,the Test Case Generation Evaluator(TCGE),based on the learning to rank(L2R)algorithm.Unlike previous approaches,our method comprehensively evaluates algorithms by considering multiple metrics,resulting in a more reasoned assessment.The main principle of the TCGE is the formation of feature vectors that are of concern by the tester.Through training,the feature vectors are sorted to generate a list,with the order of the methods on the list determined according to their effectiveness on the tested assembly.We implement TCGE using three L2R algorithms:Listnet,LambdaMART,and RFLambdaMART.Evaluation employs a dataset with features of classical test case generation algorithms and three metrics—Normalized Discounted Cumulative Gain(NDCG),Mean Average Precision(MAP),and Mean Reciprocal Rank(MRR).Results demonstrate the TCGE’s superior effectiveness in evaluating test case generation algorithms compared to other methods.Among the three L2R algorithms,RFLambdaMART proves the most effective,achieving an accuracy above 96.5%,surpassing LambdaMART by 2%and Listnet by 1.5%.Consequently,the TCGE framework exhibits significant application value in the evaluation of test case generation algorithms.展开更多
As a nonparametric method,the Kruskal-Wallis test is widely used to compare three or more independent groups when an ordinal or interval level of data is available,especially when the assump-tions of analysis of varia...As a nonparametric method,the Kruskal-Wallis test is widely used to compare three or more independent groups when an ordinal or interval level of data is available,especially when the assump-tions of analysis of variance (ANOVA) are not met.If the Kruskal-Wallis statistic is statistically signifi-cant,Nemenyi test is an alternative method for further pairwise multiple comparisons to locate the source of significance.Unfortunately,most popular statistical packages do not integrate the Nemenyi test,which is not easy to be calculated by hand.We described the theory and applications of the Kruskal-Wallis and Nemenyi tests,and presented a flexible SAS macro to implement the two tests.The SAS macro was demonstrated by two examples from our cohort study in occupational epidemiology.It provides a useful tool for SAS users to test the differences among three or more independent groups using a nonparametric method.展开更多
A maximum test in lieu of forcing a choice between the two dependent samples t-test and Wilcoxon signed-ranks test is proposed. The maximum test, which requires a new table of critical values, maintains nominal α whi...A maximum test in lieu of forcing a choice between the two dependent samples t-test and Wilcoxon signed-ranks test is proposed. The maximum test, which requires a new table of critical values, maintains nominal α while guaranteeing the maximum power of the two constituent tests. Critical values, obtained via Monte Carlo methods, are uniformly smaller than the Bonferroni-Dunn adjustment, giving it power superiority when testing for treatment alternatives of shift in location parameter when data are sampled from non-normal distributions.展开更多
We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the n...We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the novel nonparametric test based on the test proposed by Baumgartern, Weiβ, and Schindler (1998). An extensive numerical power comparison for various parametric and nonparametric tests was conducted under a wide range of bivariate distributions for small sample sizes. The two new nonparametric tests have comparable power to the paired t test for the data simulated from bivariate normal distributions, and are generally more powerful than the paired t test and other commonly used nonparametric tests in several important bivariate distributions.展开更多
This paper concerns the Log-rank test for comparing survival curves of neonatal mortality characteristic groups in River Nile State, Sudan. In this paper, log-rank test is used to compare two or more survival curves f...This paper concerns the Log-rank test for comparing survival curves of neonatal mortality characteristic groups in River Nile State, Sudan. In this paper, log-rank test is used to compare two or more survival curves for the characteristics of newborn associated with newborn death after using Kaplan-Meier methods to estimate and graph survival curves for the variable of interest as (sex of newborn, weight of newborn, gestational age, mode of delivery and resident type), at the hospital of River Nile state—Sudan, with a sample size 700 of newborn in which the admission to the Neonatal Intensive Care Unit (NICU) of those hospitals during the period 2018-2020. In term of risk of death for newborn we found that 25% of sample study for newborns who were born in River Nile State-Sudan died. In addition, we conclude that after the log-rank statistics and Kaplan-Meier methods were applied, gender does not affect the newborn’s risk of survival, while the risk of survival increases when the birth weight is greater than 4.35 kg and the gestational age is greater than 42 weeks. There is no difference in the probability of survival for newborns whether the delivery is normal or cesarean. However, newborns are significantly more likely to survive in urban areas than in rural areas.展开更多
文摘In software testing,the quality of test cases is crucial,but manual generation is time-consuming.Various automatic test case generation methods exist,requiring careful selection based on program features.Current evaluation methods compare a limited set of metrics,which does not support a larger number of metrics or consider the relative importance of each metric to the final assessment.To address this,we propose an evaluation tool,the Test Case Generation Evaluator(TCGE),based on the learning to rank(L2R)algorithm.Unlike previous approaches,our method comprehensively evaluates algorithms by considering multiple metrics,resulting in a more reasoned assessment.The main principle of the TCGE is the formation of feature vectors that are of concern by the tester.Through training,the feature vectors are sorted to generate a list,with the order of the methods on the list determined according to their effectiveness on the tested assembly.We implement TCGE using three L2R algorithms:Listnet,LambdaMART,and RFLambdaMART.Evaluation employs a dataset with features of classical test case generation algorithms and three metrics—Normalized Discounted Cumulative Gain(NDCG),Mean Average Precision(MAP),and Mean Reciprocal Rank(MRR).Results demonstrate the TCGE’s superior effectiveness in evaluating test case generation algorithms compared to other methods.Among the three L2R algorithms,RFLambdaMART proves the most effective,achieving an accuracy above 96.5%,surpassing LambdaMART by 2%and Listnet by 1.5%.Consequently,the TCGE framework exhibits significant application value in the evaluation of test case generation algorithms.
基金supported by a grant from the National Basic Research Program of China (No. 2011CB503804)
文摘As a nonparametric method,the Kruskal-Wallis test is widely used to compare three or more independent groups when an ordinal or interval level of data is available,especially when the assump-tions of analysis of variance (ANOVA) are not met.If the Kruskal-Wallis statistic is statistically signifi-cant,Nemenyi test is an alternative method for further pairwise multiple comparisons to locate the source of significance.Unfortunately,most popular statistical packages do not integrate the Nemenyi test,which is not easy to be calculated by hand.We described the theory and applications of the Kruskal-Wallis and Nemenyi tests,and presented a flexible SAS macro to implement the two tests.The SAS macro was demonstrated by two examples from our cohort study in occupational epidemiology.It provides a useful tool for SAS users to test the differences among three or more independent groups using a nonparametric method.
文摘A maximum test in lieu of forcing a choice between the two dependent samples t-test and Wilcoxon signed-ranks test is proposed. The maximum test, which requires a new table of critical values, maintains nominal α while guaranteeing the maximum power of the two constituent tests. Critical values, obtained via Monte Carlo methods, are uniformly smaller than the Bonferroni-Dunn adjustment, giving it power superiority when testing for treatment alternatives of shift in location parameter when data are sampled from non-normal distributions.
文摘We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the novel nonparametric test based on the test proposed by Baumgartern, Weiβ, and Schindler (1998). An extensive numerical power comparison for various parametric and nonparametric tests was conducted under a wide range of bivariate distributions for small sample sizes. The two new nonparametric tests have comparable power to the paired t test for the data simulated from bivariate normal distributions, and are generally more powerful than the paired t test and other commonly used nonparametric tests in several important bivariate distributions.
文摘This paper concerns the Log-rank test for comparing survival curves of neonatal mortality characteristic groups in River Nile State, Sudan. In this paper, log-rank test is used to compare two or more survival curves for the characteristics of newborn associated with newborn death after using Kaplan-Meier methods to estimate and graph survival curves for the variable of interest as (sex of newborn, weight of newborn, gestational age, mode of delivery and resident type), at the hospital of River Nile state—Sudan, with a sample size 700 of newborn in which the admission to the Neonatal Intensive Care Unit (NICU) of those hospitals during the period 2018-2020. In term of risk of death for newborn we found that 25% of sample study for newborns who were born in River Nile State-Sudan died. In addition, we conclude that after the log-rank statistics and Kaplan-Meier methods were applied, gender does not affect the newborn’s risk of survival, while the risk of survival increases when the birth weight is greater than 4.35 kg and the gestational age is greater than 42 weeks. There is no difference in the probability of survival for newborns whether the delivery is normal or cesarean. However, newborns are significantly more likely to survive in urban areas than in rural areas.