Analysis of multi-environment trials (METs) of crops for the evaluation and recommendation of varieties is an important issue in plant breeding research. Evaluating on the both stability of performance and high yiel...Analysis of multi-environment trials (METs) of crops for the evaluation and recommendation of varieties is an important issue in plant breeding research. Evaluating on the both stability of performance and high yield is essential in MET analyses. The objective of the present investigation was to compare 11 nonparametric stability statistics and apply nonparametric tests for genotype-by-environment interaction (GEI) to 14 maize (Zea mays L.) genotypes grown at 25 locations in southwestern China during 2005. Results of nonparametric tests of GEl and a combined ANOVA across locations showed that both crossover and noncrossover GEI, and genotypes varied highly significantly for yield. The results of principal component analysis, correlation analysis of nonparametric statistics, and yield indicated the nonparametric statistics grouped as four distinct classes that corresponded to different agronomic and biological concepts of stability. Furthermore, high values of TOP and low values of rank-sum were associated with high mean yield, but the other nonparametric statistics were not positively correlated with mean yield. Therefore, only rank-sum and TOP methods would be useful for simultaneously selection for high yield and stability. These two statistics recommended JY686 and HX168 as desirable and ND108, CM12, CN36, and NK6661 as undesirable genotypes.展开更多
In the multilevel thresholding segmentation of the image, the classification number is always given by the supervisor. To solve this problem, a fast multilevel thresholding algorithm considering both the threshold val...In the multilevel thresholding segmentation of the image, the classification number is always given by the supervisor. To solve this problem, a fast multilevel thresholding algorithm considering both the threshold value and the classification number is proposed based on the maximum entropy, and the self-adaptive criterion of the classification number is given. The algorithm can obtain thresholds and automatically decide the classification number. Experimental results show that the algorithm is effective.展开更多
Space-time disease cluster detection assists in conducting disease surveillance and implementing control strategies.The state-of-the-art method for this kind of problem is the Space-time Scan Statistics(SaTScan)which ...Space-time disease cluster detection assists in conducting disease surveillance and implementing control strategies.The state-of-the-art method for this kind of problem is the Space-time Scan Statistics(SaTScan)which has limitations for non-traditional/non-clinical data sources due to its parametric model assumptions such as Poisson orGaussian counts.Addressing this problem,an Eigenspace-based method called Multi-EigenSpot has recently been proposed as a nonparametric solution.However,it is based on the population counts data which are not always available in the least developed countries.In addition,the population counts are difficult to approximate for some surveillance data such as emergency department visits and over-the-counter drug sales,where the catchment area for each hospital/pharmacy is undefined.We extend the population-based Multi-EigenSpot method to approximate the potential disease clusters from the observed/reported disease counts only with no need for the population counts.The proposed adaptation uses an estimator of expected disease count that does not depend on the population counts.The proposed method was evaluated on the real-world dataset and the results were compared with the population-based methods:Multi-EigenSpot and SaTScan.The result shows that the proposed adaptation is effective in approximating the important outputs of the population-based methods.展开更多
To dates,most ship detection approaches for single-pol synthetic aperture radar(SAR) imagery try to ensure a constant false-alarm rate(CFAR).A high performance ship detector relies on two key components:an accura...To dates,most ship detection approaches for single-pol synthetic aperture radar(SAR) imagery try to ensure a constant false-alarm rate(CFAR).A high performance ship detector relies on two key components:an accurate estimation to a sea surface distribution and a fine designed CFAR algorithm.First,a novel nonparametric sea surface distribution estimation method is developed based on n-order Bézier curve.To estimate the sea surface distribution using n-order Bézier curve,an explicit analytical solution is derived based on a least square optimization,and the optimal selection also is presented to two essential parameters,the order n of Bézier curve and the number m of sample points.Next,to validate the ship detection performance of the estimated sea surface distribution,the estimated sea surface distribution by n-order Bézier curve is combined with a cell averaging CFAR(CA-CFAR).To eliminate the possible interfering ship targets in background window,an improved automatic censoring method is applied.Comprehensive experiments prove that in terms of sea surface estimation performance,the proposed method is as good as a traditional nonparametric Parzen window kernel method,and in most cases,outperforms two widely used parametric methods,K and G0 models.In terms of computation speed,a major advantage of the proposed estimation method is the time consuming only depended on the number m of sample points while independent of imagery size,which makes it can achieve a significant speed improvement to the Parzen window kernel method,and in some cases,it is even faster than two parametric methods.In terms of ship detection performance,the experiments show that the ship detector which constructed by the proposed sea surface distribution model and the given CA-CFAR algorithm has wide adaptability to different SAR sensors,resolutions and sea surface homogeneities and obtains a leading performance on the test dataset.展开更多
This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to o...This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to obtain a static equilibrium state of an elastic structure. Preliminary numerical experiments illustrate that, compared with existing methods, the proposed method finds a reasonable solution even if data points distribute coarsely in a given material data set.展开更多
Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global no...Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global nonparametric tests for homogeneity such as the Kolmogorv-Smirnov test, testing the equality of a set of percentiles (i.e., a percentile profile) yields an estimate of the location and extent of the differences between the populations along the entire domain. The Wald test using bootstrap estimates of variance of the order statistics provides a unified method for hypothesis testing of functions of the population percentiles. Simulation studies are conducted to show performance of the method under various scenarios and to give suggestions on its use. Several examples are given to illustrate some useful applications to real data.展开更多
Recent advances in high-throughput chromosome conformation capture(Hi-C)techniques have allowed us to map genome-wide chromatin interactions and uncover higher-order chromatin structures,thereby shedding light on the ...Recent advances in high-throughput chromosome conformation capture(Hi-C)techniques have allowed us to map genome-wide chromatin interactions and uncover higher-order chromatin structures,thereby shedding light on the principles of genome architecture and functions.However,statistical methods for detecting changes in large-scale chromatin organization such as topologically associating domains(TADs)are still lacking.Here,we proposed a new statistical method,DiffGR,for detecting differentially interacting genomic regions at the TAD level between Hi-C contact maps.We utilized the stratum-adjusted correlation coefficient to measure similarity of local TAD regions.We then developed a nonparametric approach to identify statistically significant changes of genomic interacting regions.Through simulation studies,we demonstrated that DiffGR can robustly and effectively discover differential genomic regions under various conditions.Furthermore,we successfully revealed cell type-specific changes in genomic interacting regions in both human and mouse Hi-C datasets,and illustrated that DiffGR yielded consistent and advantageous results compared with state-of-the-art differential TAD detection methods.The DiffGR R package is published under the GNU General Public License(GPL)≥2 license and is publicly available at https://github.com/wmalab/DiffGR.展开更多
Predicting the future course of an epidemic depends on being able to estimate the current numbers of infected individuals.However,while back-projection techniques allow reliable estimation of the numbers of infected i...Predicting the future course of an epidemic depends on being able to estimate the current numbers of infected individuals.However,while back-projection techniques allow reliable estimation of the numbers of infected individuals in the more distant past,they are less reliable in the recent past.We propose two new nonparametric methods to estimate the unobserved numbers of infected individuals in the recent past in an epidemic.The proposed methods are noniterative,easily computed and asymptotically normal with simple variance formulas.Simulations show that the proposed methods are much more robust and accurate than the existing back projection method,especially for the recent past,which is our primary interest.We apply the proposed methods to the 2003 Severe Acute Respiratory Syndorme(SARS) epidemic in Hong Kong.展开更多
基金the Program for Changjiang Scholars and Innovative Research Team in University,China(IRT0453)the Youth Foundation of Sichuan Province Office of Education(2006B005) of China,for supporting this research
文摘Analysis of multi-environment trials (METs) of crops for the evaluation and recommendation of varieties is an important issue in plant breeding research. Evaluating on the both stability of performance and high yield is essential in MET analyses. The objective of the present investigation was to compare 11 nonparametric stability statistics and apply nonparametric tests for genotype-by-environment interaction (GEI) to 14 maize (Zea mays L.) genotypes grown at 25 locations in southwestern China during 2005. Results of nonparametric tests of GEl and a combined ANOVA across locations showed that both crossover and noncrossover GEI, and genotypes varied highly significantly for yield. The results of principal component analysis, correlation analysis of nonparametric statistics, and yield indicated the nonparametric statistics grouped as four distinct classes that corresponded to different agronomic and biological concepts of stability. Furthermore, high values of TOP and low values of rank-sum were associated with high mean yield, but the other nonparametric statistics were not positively correlated with mean yield. Therefore, only rank-sum and TOP methods would be useful for simultaneously selection for high yield and stability. These two statistics recommended JY686 and HX168 as desirable and ND108, CM12, CN36, and NK6661 as undesirable genotypes.
文摘In the multilevel thresholding segmentation of the image, the classification number is always given by the supervisor. To solve this problem, a fast multilevel thresholding algorithm considering both the threshold value and the classification number is proposed based on the maximum entropy, and the self-adaptive criterion of the classification number is given. The algorithm can obtain thresholds and automatically decide the classification number. Experimental results show that the algorithm is effective.
基金This article was funded by a Fundamental Research Grant Scheme(FRGS)from the Ministry of Education,Malaysia(Ref:FRGS/1/2018/STG06/UTP/02/1)a Yayasan Universiti Teknologi PETRONAS-Fundamental Research Grant(cost center of 015LC0-013)received by Hanita Daud,URLs:https://www.mohe.gov.my/en/initiatives-2/187-program-utama/penyelidikan/548-research-grants-informationhttps://www.utp.edu.my/yayasan/Pages/default.aspx.
文摘Space-time disease cluster detection assists in conducting disease surveillance and implementing control strategies.The state-of-the-art method for this kind of problem is the Space-time Scan Statistics(SaTScan)which has limitations for non-traditional/non-clinical data sources due to its parametric model assumptions such as Poisson orGaussian counts.Addressing this problem,an Eigenspace-based method called Multi-EigenSpot has recently been proposed as a nonparametric solution.However,it is based on the population counts data which are not always available in the least developed countries.In addition,the population counts are difficult to approximate for some surveillance data such as emergency department visits and over-the-counter drug sales,where the catchment area for each hospital/pharmacy is undefined.We extend the population-based Multi-EigenSpot method to approximate the potential disease clusters from the observed/reported disease counts only with no need for the population counts.The proposed adaptation uses an estimator of expected disease count that does not depend on the population counts.The proposed method was evaluated on the real-world dataset and the results were compared with the population-based methods:Multi-EigenSpot and SaTScan.The result shows that the proposed adaptation is effective in approximating the important outputs of the population-based methods.
基金The National Natural Science Foundation of China under contract No.61471024the National Marine Technology Program for Public Welfare under contract No.201505002-1the Beijing Higher Education Young Elite Teacher Project under contract No.YETP0514
文摘To dates,most ship detection approaches for single-pol synthetic aperture radar(SAR) imagery try to ensure a constant false-alarm rate(CFAR).A high performance ship detector relies on two key components:an accurate estimation to a sea surface distribution and a fine designed CFAR algorithm.First,a novel nonparametric sea surface distribution estimation method is developed based on n-order Bézier curve.To estimate the sea surface distribution using n-order Bézier curve,an explicit analytical solution is derived based on a least square optimization,and the optimal selection also is presented to two essential parameters,the order n of Bézier curve and the number m of sample points.Next,to validate the ship detection performance of the estimated sea surface distribution,the estimated sea surface distribution by n-order Bézier curve is combined with a cell averaging CFAR(CA-CFAR).To eliminate the possible interfering ship targets in background window,an improved automatic censoring method is applied.Comprehensive experiments prove that in terms of sea surface estimation performance,the proposed method is as good as a traditional nonparametric Parzen window kernel method,and in most cases,outperforms two widely used parametric methods,K and G0 models.In terms of computation speed,a major advantage of the proposed estimation method is the time consuming only depended on the number m of sample points while independent of imagery size,which makes it can achieve a significant speed improvement to the Parzen window kernel method,and in some cases,it is even faster than two parametric methods.In terms of ship detection performance,the experiments show that the ship detector which constructed by the proposed sea surface distribution model and the given CA-CFAR algorithm has wide adaptability to different SAR sensors,resolutions and sea surface homogeneities and obtains a leading performance on the test dataset.
基金supported by JSPS KAKENHI (Grants 17K06633 and 18K18898)
文摘This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to obtain a static equilibrium state of an elastic structure. Preliminary numerical experiments illustrate that, compared with existing methods, the proposed method finds a reasonable solution even if data points distribute coarsely in a given material data set.
文摘Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global nonparametric tests for homogeneity such as the Kolmogorv-Smirnov test, testing the equality of a set of percentiles (i.e., a percentile profile) yields an estimate of the location and extent of the differences between the populations along the entire domain. The Wald test using bootstrap estimates of variance of the order statistics provides a unified method for hypothesis testing of functions of the population percentiles. Simulation studies are conducted to show performance of the method under various scenarios and to give suggestions on its use. Several examples are given to illustrate some useful applications to real data.
基金supported by the National Science Foundation,USA(Grant No.DBI-1751317)the National Institute of Health,USA(Grant No.R35GM133678).
文摘Recent advances in high-throughput chromosome conformation capture(Hi-C)techniques have allowed us to map genome-wide chromatin interactions and uncover higher-order chromatin structures,thereby shedding light on the principles of genome architecture and functions.However,statistical methods for detecting changes in large-scale chromatin organization such as topologically associating domains(TADs)are still lacking.Here,we proposed a new statistical method,DiffGR,for detecting differentially interacting genomic regions at the TAD level between Hi-C contact maps.We utilized the stratum-adjusted correlation coefficient to measure similarity of local TAD regions.We then developed a nonparametric approach to identify statistically significant changes of genomic interacting regions.Through simulation studies,we demonstrated that DiffGR can robustly and effectively discover differential genomic regions under various conditions.Furthermore,we successfully revealed cell type-specific changes in genomic interacting regions in both human and mouse Hi-C datasets,and illustrated that DiffGR yielded consistent and advantageous results compared with state-of-the-art differential TAD detection methods.The DiffGR R package is published under the GNU General Public License(GPL)≥2 license and is publicly available at https://github.com/wmalab/DiffGR.
基金supported in part by National Natural Science Foundation of China(Grant Nos. 10771148,11071197)supported by an RGC grant,the Chief Executive Community Project and Hong Kong Jockey Club Charities Trust
文摘Predicting the future course of an epidemic depends on being able to estimate the current numbers of infected individuals.However,while back-projection techniques allow reliable estimation of the numbers of infected individuals in the more distant past,they are less reliable in the recent past.We propose two new nonparametric methods to estimate the unobserved numbers of infected individuals in the recent past in an epidemic.The proposed methods are noniterative,easily computed and asymptotically normal with simple variance formulas.Simulations show that the proposed methods are much more robust and accurate than the existing back projection method,especially for the recent past,which is our primary interest.We apply the proposed methods to the 2003 Severe Acute Respiratory Syndorme(SARS) epidemic in Hong Kong.