Group testing is a method that can be used to estimate the prevalence of rare infectious diseases,which can effectively save time and reduce costs compared to the method of random sampling.However,previous literature ...Group testing is a method that can be used to estimate the prevalence of rare infectious diseases,which can effectively save time and reduce costs compared to the method of random sampling.However,previous literature only demonstrated the optimality of group testing strategy while estimating prevalence under some strong assumptions.This article weakens the assumption of misclassification rate in the previous literature,considers the misclassification rate of the infected samples as a differentiable function of the pool size,and explores some optimal properties of group testing for estimating prevalence in the presence of differential misclassification conforming to this assumption.This article theoretically demonstrates that the group testing strategy performs better than the sample by sample procedure in estimating disease prevalence when the total number of sample pools is given or the size of the test population is determined.Numerical simulation experiments were conducted to evaluate the performance of group tests in estimating prevalence in the presence of dilution effect.展开更多
Distance-based regression model,as a nonparametric multivariate method,has been widely used to detect the association between variations in a distance or dissimilarity matrix for outcomes and predictor variables of in...Distance-based regression model,as a nonparametric multivariate method,has been widely used to detect the association between variations in a distance or dissimilarity matrix for outcomes and predictor variables of interest in genetic association studies,genomic analyses,and many other research areas.Based on it,a pseudo-F statistic which partitions the variation in distance matrices is often constructed to achieve the aim.To the best of our knowledge,the statistical properties of the pseudo-F statistic has not yet been well established in the literature.To fill this gap,the authors study the asymptotic null distribution of the pseudo-F statistic and show that it is asymptotically equivalent to a mixture of chi-squared random variables.Given that the pseudo-F test statistic has unsatisfactory power when the correlations of the response variables are large,the authors propose a square-root F-type test statistic which replaces the similarity matrix with its square root.The asymptotic null distribution of the new test statistic and power of both tests are also investigated.Simulation studies are conducted to validate the asymptotic distributions of the tests and demonstrate that the proposed test has more robust power than the pseudo-F test.Both test statistics are exemplified with a gene expression dataset for a prostate cancer pathway.展开更多
基金supported by the National Natural Science Foundation of China(Grant No.72091212).
文摘Group testing is a method that can be used to estimate the prevalence of rare infectious diseases,which can effectively save time and reduce costs compared to the method of random sampling.However,previous literature only demonstrated the optimality of group testing strategy while estimating prevalence under some strong assumptions.This article weakens the assumption of misclassification rate in the previous literature,considers the misclassification rate of the infected samples as a differentiable function of the pool size,and explores some optimal properties of group testing for estimating prevalence in the presence of differential misclassification conforming to this assumption.This article theoretically demonstrates that the group testing strategy performs better than the sample by sample procedure in estimating disease prevalence when the total number of sample pools is given or the size of the test population is determined.Numerical simulation experiments were conducted to evaluate the performance of group tests in estimating prevalence in the presence of dilution effect.
基金partially supported by Beijing Natural Science Foundation under Grant No.Z180006.
文摘Distance-based regression model,as a nonparametric multivariate method,has been widely used to detect the association between variations in a distance or dissimilarity matrix for outcomes and predictor variables of interest in genetic association studies,genomic analyses,and many other research areas.Based on it,a pseudo-F statistic which partitions the variation in distance matrices is often constructed to achieve the aim.To the best of our knowledge,the statistical properties of the pseudo-F statistic has not yet been well established in the literature.To fill this gap,the authors study the asymptotic null distribution of the pseudo-F statistic and show that it is asymptotically equivalent to a mixture of chi-squared random variables.Given that the pseudo-F test statistic has unsatisfactory power when the correlations of the response variables are large,the authors propose a square-root F-type test statistic which replaces the similarity matrix with its square root.The asymptotic null distribution of the new test statistic and power of both tests are also investigated.Simulation studies are conducted to validate the asymptotic distributions of the tests and demonstrate that the proposed test has more robust power than the pseudo-F test.Both test statistics are exemplified with a gene expression dataset for a prostate cancer pathway.