期刊文献+
共找到55,279篇文章
< 1 2 250 >
每页显示 20 50 100
A Simple Chi-Square Statistic for Testing Homogeneity of Zero-Inflated Distributions
1
作者 William D. Johnson Jeffrey H. Burton +1 位作者 Robbie A. Beyl Jacob E. Romer 《Open Journal of Statistics》 2015年第6期483-493,共11页
Zero-inflated distributions are common in statistical problems where there is interest in testing homogeneity of two or more independent groups. Often, the underlying distribution that has an inflated number of zero-v... Zero-inflated distributions are common in statistical problems where there is interest in testing homogeneity of two or more independent groups. Often, the underlying distribution that has an inflated number of zero-valued observations is asymmetric, and its functional form may not be known or easily characterized. In this case, comparisons of the groups in terms of their respective percentiles may be appropriate as these estimates are nonparametric and more robust to outliers and other irregularities. The median test is often used to compare distributions with similar but asymmetric shapes but may be uninformative when there are excess zeros or dissimilar shapes. For zero-inflated distributions, it is useful to compare the distributions with respect to their proportion of zeros, coupled with the comparison of percentile profiles for the observed non-zero values. A simple chi-square test for simultaneous testing of these two components is proposed, applicable to both continuous and discrete data. Results of simulation studies are reported to summarize empirical power under several scenarios. We give recommendations for the minimum sample size which is necessary to achieve suitable test performance in specific examples. 展开更多
关键词 Asymptotic chi-square TEST EQUALITY of QUANTILES Large Sample TEST Nonparametric TEST Percentile Profiles ZERO-INFLATED DISTRIBUTIONS
下载PDF
Lottery Numbers and Ordered Statistics
2
作者 Kung-Kuen Tse 《Applied Mathematics》 2024年第4期287-291,共5页
The lottery has long captivated the imagination of players worldwide, offering the tantalizing possibility of life-changing wins. While winning the lottery is largely a matter of chance, as lottery drawings are typica... The lottery has long captivated the imagination of players worldwide, offering the tantalizing possibility of life-changing wins. While winning the lottery is largely a matter of chance, as lottery drawings are typically random and unpredictable. Some people use the lottery terminal randomly generates numbers for them, some players choose numbers that hold personal significance to them, such as birthdays, anniversaries, or other important dates, some enthusiasts have turned to statistical analysis as a means to analyze past winning numbers identify patterns or frequencies. In this paper, we use order statistics to estimate the probability of specific order of numbers or number combinations being drawn in future drawings. 展开更多
关键词 LOTTERY Order statistics Hypergeometric Distribution EXPECTATION UNIFORM
下载PDF
Efficient ECG classification based on Chi-square distance for arrhythmia detection
3
作者 Dhiah Al-Shammary Mustafa Noaman Kadhim +2 位作者 Ahmed M.Mahdi Ayman Ibaida Khandakar Ahmedb 《Journal of Electronic Science and Technology》 EI CAS CSCD 2024年第2期1-15,共15页
This study introduces a new classifier tailored to address the limitations inherent in conventional classifiers such as K-nearest neighbor(KNN),random forest(RF),decision tree(DT),and support vector machine(SVM)for ar... This study introduces a new classifier tailored to address the limitations inherent in conventional classifiers such as K-nearest neighbor(KNN),random forest(RF),decision tree(DT),and support vector machine(SVM)for arrhythmia detection.The proposed classifier leverages the Chi-square distance as a primary metric,providing a specialized and original approach for precise arrhythmia detection.To optimize feature selection and refine the classifier’s performance,particle swarm optimization(PSO)is integrated with the Chi-square distance as a fitness function.This synergistic integration enhances the classifier’s capabilities,resulting in a substantial improvement in accuracy for arrhythmia detection.Experimental results demonstrate the efficacy of the proposed method,achieving a noteworthy accuracy rate of 98% with PSO,higher than 89% achieved without any previous optimization.The classifier outperforms machine learning(ML)and deep learning(DL)techniques,underscoring its reliability and superiority in the realm of arrhythmia classification.The promising results render it an effective method to support both academic and medical communities,offering an advanced and precise solution for arrhythmia detection in electrocardiogram(ECG)data. 展开更多
关键词 Arrhythmia classification chi-square distance Electrocardiogram(ECG)signal Particle swarm optimization(PSO)
下载PDF
Evaluation of Serum Anti-Müllerian Hormone (AMH) Values for 28,016 Bulgarian Women: Prognostic Statistical Model of Age Specific AMH Declining
4
作者 Martin Vladimirov Evan Gatev +6 位作者 Desislava Tacheva Aleksandra Kalacheva Milena Bojilova Serpil Izet Alexander Angelov Nedyalko Kalatchev Iavor K. Vladimirov 《Open Journal of Obstetrics and Gynecology》 2024年第5期651-673,共23页
The present study aims to establish a relationship between serum AMH levels and age in a large group of women living in Bulgaria, as well as to establish reference age-specific AMH levels in women that would serve as ... The present study aims to establish a relationship between serum AMH levels and age in a large group of women living in Bulgaria, as well as to establish reference age-specific AMH levels in women that would serve as an initial estimate of ovarian age. A total of 28,016 women on the territory of the Republic of Bulgaria were tested for serum AMH levels with a median age of 37.0 years (interquartile range 32.0 to 41.0). For women aged 20 - 29 years, the Bulgarian population has relatively high median levels of AMH, similar to women of Asian origin. For women aged 30 - 34 years, our results are comparable to those of women living in Western Europe. For women aged 35 - 39 years, our results are comparable to those of women living in the territory of India and Kenya. For women aged 40 - 44 years, our results were lower than those for women from the Western European and Chinese populations, close to the Indian and higher than Korean and Kenya populations, respectively. Our results for women of Bulgarian origin are also comparable to US Latina women at age 30, 35 and 40 ages. On the base on constructed a statistical model to predicting the decline in AMH levels at different ages, we found non-linear structure of AMH decline for the low AMH 3.5) the dependence of the decline of AMH on age was confirmed as linear. In conclusion, we evaluated the serum level of AMH in Bulgarian women and established age-specific AMH percentile reference values based on a large representative sample. We have developed a prognostic statistical model that can facilitate the application of AMH in clinical practice and the prediction of reproductive capacity and population health. 展开更多
关键词 Anti-Müllerian Hormone Women Age Ovarian Response ETHNICITY Prognostic statistical Model
下载PDF
Rapid Prediction of Wastewater Index Using CNN Architecture and PLS Series Statistical Methods
5
作者 Qiushuang Mo Lili Xu +2 位作者 Fangxiu Meng Shaoyong Hong Xuemei Lin 《Open Journal of Statistics》 2024年第3期243-258,共16页
Chemical oxygen demand (COD) is an important index to measure the degree of water pollution. In this paper, near-infrared technology is used to obtain 148 wastewater spectra to predict the COD value in wastewater. Fir... Chemical oxygen demand (COD) is an important index to measure the degree of water pollution. In this paper, near-infrared technology is used to obtain 148 wastewater spectra to predict the COD value in wastewater. First, the partial least squares regression (PLS) model was used as the basic model. Monte Carlo cross-validation (MCCV) was used to select 25 samples out of 148 samples that did not conform to conventional statistics. Then, the interval partial least squares (iPLS) regression modeling was carried out on 123 samples, and the spectral bands were divided into 40 subintervals. The optimal subintervals are 20 and 26, and the optimal correlation coefficient of the test set (RT) is 0.58. Further, the waveband is divided into five intervals: 17, 19, 20, 22 and 26. When the number of joint intervals under each interval is three, the optimal RT is 0.71. When the number of joint subintervals is four, the optimal RT is 0.79. Finally, convolutional neural network (CNN) was used for quantitative prediction, and RT was 0.9. The results show that CNN can automatically screen the features inside the data, and the quantitative prediction effect is better than that of iPLS and synergy interval partial least squares model (SiPLS) with joint subinterval three and four, indicating that CNN can be used for quantitative analysis of water pollution degree. 展开更多
关键词 WASTEWATER Near-Infrared Spectroscopy Chemistry Oxygen Demand Partial Least Squares Convolutional Neural Network statistical Optimization
下载PDF
Effect Evaluation of CBL Combined with Rain Classroom Teaching Method in Medical Statistics
6
作者 Man Luo Xiaofang Zhang Wei Liu 《Open Journal of Applied Sciences》 2024年第5期1204-1213,共10页
Objective: To explore the application effect of CBL combined with rain classroom teaching method in medical statistics courses. Methods: The undergraduate students of medical imaging technology in 2019 and 2020 in a u... Objective: To explore the application effect of CBL combined with rain classroom teaching method in medical statistics courses. Methods: The undergraduate students of medical imaging technology in 2019 and 2020 in a university were selected as the research objects. A cluster sampling method was used to select 79 undergraduate students from 2019 in the control group and 75 undergraduate students from 2020 in the experimental group. Traditional teaching method and CBL combined with rain classroom teaching method was used in the control group and experimental group respectively. The final examination scores of the two groups were compared. In experimental group, the correlation between the average score in the rain classroom and the final examination score was tested, and the teaching effect was evaluated. Results: The average score of final examination in experimental group and control group was 79.13 ± 10.32 points and 71.54 ± 14.752 points, respectively, which had a statistically significant difference (Z = 2.586, P = 0.012);the final examination scores of the students in the experimental group were positively correlated with the average scores of the rain classroom (r = 0.372, P = 0.001), and the proportion of satisfaction in the experimental group was 94.7%. Conclusion: The CBL combined with rain classroom teaching method can improve the teaching effectiveness of medical statistics courses. 展开更多
关键词 Rain Classroom CBL Medical statistics
下载PDF
Smart Healthcare Activity Recognition Using Statistical Regression and Intelligent Learning
7
作者 K.Akilandeswari Nithya Rekha Sivakumar +2 位作者 Hend Khalid Alkahtani Shakila Basheer Sara Abdelwahab Ghorashi 《Computers, Materials & Continua》 SCIE EI 2024年第1期1189-1205,共17页
In this present time,Human Activity Recognition(HAR)has been of considerable aid in the case of health monitoring and recovery.The exploitation of machine learning with an intelligent agent in the area of health infor... In this present time,Human Activity Recognition(HAR)has been of considerable aid in the case of health monitoring and recovery.The exploitation of machine learning with an intelligent agent in the area of health informatics gathered using HAR augments the decision-making quality and significance.Although many research works conducted on Smart Healthcare Monitoring,there remain a certain number of pitfalls such as time,overhead,and falsification involved during analysis.Therefore,this paper proposes a Statistical Partial Regression and Support Vector Intelligent Agent Learning(SPR-SVIAL)for Smart Healthcare Monitoring.At first,the Statistical Partial Regression Feature Extraction model is used for data preprocessing along with the dimensionality-reduced features extraction process.Here,the input dataset the continuous beat-to-beat heart data,triaxial accelerometer data,and psychological characteristics were acquired from IoT wearable devices.To attain highly accurate Smart Healthcare Monitoring with less time,Partial Least Square helps extract the dimensionality-reduced features.After that,with these resulting features,SVIAL is proposed for Smart Healthcare Monitoring with the help of Machine Learning and Intelligent Agents to minimize both analysis falsification and overhead.Experimental evaluation is carried out for factors such as time,overhead,and false positive rate accuracy concerning several instances.The quantitatively analyzed results indicate the better performance of our proposed SPR-SVIAL method when compared with two state-of-the-art methods. 展开更多
关键词 Internet of Things smart health care monitoring human activity recognition intelligent agent learning statistical partial regression support vector
下载PDF
Comparative Analysis of Statistical Thickness Models for the Determination of the External Specific Surface and the Surface of the Micropores of Materials: The Case of a Clay Concrete Stabilized Using Sugar Cane Molasses
8
作者 Nice Mfoutou Ngouallat Narcisse Malanda +3 位作者 Christ Ariel Ceti Malanda Kris Berjovie Maniongui Erman Eloge Nzaba Madila Paul Louzolo-Kimbembe 《Geomaterials》 2024年第2期13-28,共16页
In this work, four empirical models of statistical thickness, namely the models of Harkins and Jura, Hasley, Carbon Black and Jaroniec, were compared in order to determine the textural properties (external surface and... In this work, four empirical models of statistical thickness, namely the models of Harkins and Jura, Hasley, Carbon Black and Jaroniec, were compared in order to determine the textural properties (external surface and surface of micropores) of a clay concrete without molasses and clay concretes stabilized with 8%, 12% and 16% molasses. The results obtained show that Hasley’s model can be used to obtain the external surfaces. However, it does not allow the surface of the micropores to be obtained, and is not suitable for the case of simple clay concrete (without molasses) and for clay concretes stabilized with molasses. The Carbon Black, Jaroniec and Harkins and Jura models can be used for clay concrete and stabilized clay concrete. However, the Carbon Black model is the most relevant for clay concrete and the Harkins and Jura model is for molasses-stabilized clay concrete. These last two models augur well for future research. 展开更多
关键词 statistical Thickness Model External Specific Surface Microporous Surface Clay Concrete MOLASSES
下载PDF
Study on Key Biological Indicators of Diabetes Based on Statistical Tests
9
作者 Shuaibin Yang 《Journal of Clinical and Nursing Research》 2024年第7期267-273,共7页
Normality testing is a fundamental hypothesis test in the statistical analysis of key biological indicators of diabetes.If this assumption is violated,it may cause the test results to deviate from the true value,leadi... Normality testing is a fundamental hypothesis test in the statistical analysis of key biological indicators of diabetes.If this assumption is violated,it may cause the test results to deviate from the true value,leading to incorrect inferences and conclusions,and ultimately affecting the validity and accuracy of statistical inferences.Considering this,the study designs a unified analysis scheme for different data types based on parametric statistical test methods and non-parametric test methods.The data were grouped according to sample type and divided into discrete data and continuous data.To account for differences among subgroups,the conventional chi-squared test was used for discrete data.The normal distribution is the basis of many statistical methods;if the data does not follow a normal distribution,many statistical methods will fail or produce incorrect results.Therefore,before data analysis and modeling,the data were divided into normal and non-normal groups through normality testing.For normally distributed data,parametric statistical methods were used to judge the differences between groups.For non-normal data,non-parametric tests were employed to improve the accuracy of the analysis.Statistically significant indicators were retained according to the significance index P-value of the statistical test or corresponding statistics.These indicators were then combined with relevant medical background to further explore the etiology leading to the occurrence or transformation of diabetes status. 展开更多
关键词 Diabetes diagnosis statistical test Nonparametric statistics Normality test
下载PDF
Nonparametric Statistical Feature Scaling Based Quadratic Regressive Convolution Deep Neural Network for Software Fault Prediction
10
作者 Sureka Sivavelu Venkatesh Palanisamy 《Computers, Materials & Continua》 SCIE EI 2024年第3期3469-3487,共19页
The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software w... The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods. 展开更多
关键词 Software defect prediction feature selection nonparametric statistical Torgerson-Gower scaling technique quadratic censored regressive convolution deep neural network softstep activation function nelder-mead method
下载PDF
Statistical Approach to Basketball Players’Skill Level
11
作者 Jiajun Wu 《Journal of Applied Mathematics and Physics》 2024年第4期1352-1363,共12页
In basketball, each player’s skill level is the key to a team’s success or failure, the skill level is affected by many personal and environmental factors. A physics-informed AI statistics has become extremely impor... In basketball, each player’s skill level is the key to a team’s success or failure, the skill level is affected by many personal and environmental factors. A physics-informed AI statistics has become extremely important. In this article, a complex non-linear process is considered by taking into account the average points per game of each player, playing time, shooting percentage, and others. This physics-informed statistics is to construct a multiple linear regression model with physics-informed neural networks. Based on the official data provided by the American Basketball League, and combined with specific methods of R program analysis, the regression model affecting the player’s average points per game is verified, and the key factors affecting the player’s average points per game are finally elucidated. The paper provides a novel window for coaches to make meaningful in-game adjustments to team members. 展开更多
关键词 Physics-Informed statistics Multiple Linear Regression Average Score per Game R Program Analysis
下载PDF
Statistical Inversion Based on Nonlinear Weighted Anisotropic Total Variational Model and Its Application in Electrical Impedance Tomography
12
作者 Pengfei Qi 《Engineering(科研)》 2024年第1期1-7,共7页
Electrical impedance tomography (EIT) aims to reconstruct the conductivity distribution using the boundary measured voltage potential. Traditional regularization based method would suffer from error propagation due to... Electrical impedance tomography (EIT) aims to reconstruct the conductivity distribution using the boundary measured voltage potential. Traditional regularization based method would suffer from error propagation due to the iteration process. The statistical inverse problem method uses statistical inference to estimate unknown parameters. In this article, we develop a nonlinear weighted anisotropic total variation (NWATV) prior density function based on the recently proposed NWATV regularization method. We calculate the corresponding posterior density function, i.e., the solution of the EIT inverse problem in the statistical sense, via a modified Markov chain Monte Carlo (MCMC) sampling. We do numerical experiment to validate the proposed approach. 展开更多
关键词 statistical Inverse Problem Electrical Impedance Tomography NWATV Prior Markov Chain Monte Carlo Sampling
下载PDF
Improving Statistical Literacy through Evidence-Based Strategies Among First-Year Education Students in a State University
13
作者 Israel M.Castillo 《Journal of Contemporary Educational Research》 2024年第1期246-259,共14页
Statistical literacy is crucial for cultivating well-rounded thinkers.The integration of evidence-based strategies in teaching and learning is pivotal for enhancing students’statistical literacy.This research specifi... Statistical literacy is crucial for cultivating well-rounded thinkers.The integration of evidence-based strategies in teaching and learning is pivotal for enhancing students’statistical literacy.This research specifically focuses on the utilization of Share and Model Concepts and Nurturing Metacognition as evidence-based strategies aimed at improving the statistical literacy of learners.The study employed a quasi-experimental design,specifically the nonequivalent control group,wherein students answered pre-test and post-test instruments and researcher-made questionnaires.The study included 50 first-year Bachelor in Secondary Education majors in Mathematics and Science for the academic year 2023-2024.The results of the study revealed a significant difference in the scores of student respondents,indicating that the use of evidence-based strategies helped students enhance their statistical literacy.This signifies a noteworthy increase in their performance,ranging from very low to very high proficiency in understanding statistical concepts,insights into the application of statistical concepts,numeracy,graph skills,interpretation capabilities,and visualization and communication skills.Furthermore,the study showed a significant difference in the post-test scores’performance of the two groups in understanding statistical concepts and visualization and communication skills.However,no significant difference was found in the post-test scores of the two groups concerning insights into the application of statistical concepts,numeracy and graph skills,and interpretation capabilities.Additionally,students acknowledged that the implementation of evidence-based strategies significantly contributed to the improvement of their statistical literacy. 展开更多
关键词 statistical literacy Evidence-based strategies Share and model concepts Nurturing metacognition Quasiexperimental
下载PDF
Visualising data distributions with kernel density estimation and reduced chi-squared statistic 被引量:7
14
作者 C.J.Spencer C.Yakymchuk M.Ghaznavi 《Geoscience Frontiers》 SCIE CAS CSCD 2017年第6期1247-1252,共6页
The application of frequency distribution statistics to data provides objective means to assess the nature of the data distribution and viability of numerical models that are used to visualize and interpret data.Two c... The application of frequency distribution statistics to data provides objective means to assess the nature of the data distribution and viability of numerical models that are used to visualize and interpret data.Two commonly used tools are the kernel density estimation and reduced chi-squared statistic used in combination with a weighted mean.Due to the wide applicability of these tools,we present a Java-based computer application called KDX to facilitate the visualization of data and the utilization of these numerical tools. 展开更多
关键词 Data visualisation KERNEL DENSITY estimation REDUCED chi-squared statistic Mean SQUARE WEIGHTED deviation GEOstatisticS
下载PDF
Generalized Kumaraswamy Generalized Power Gompertz Distribution: Statistical Properties, Application, and Validation Using a Modified Chi-Squared Goodness of Fit Test
15
作者 Obubu Maxwell Ibeakuzie Precious Onyedikachi +2 位作者 Khaoula Aidi Chijioke Igwe Akpa Nacira Seddik-Ameur 《Applied Mathematics》 2022年第3期243-262,共20页
A new six-parameter continuous distribution called the Generalized Kumaraswamy Generalized Power Gompertz (GKGPG) distribution is proposed in this study, a graphical illustration of the probability density function an... A new six-parameter continuous distribution called the Generalized Kumaraswamy Generalized Power Gompertz (GKGPG) distribution is proposed in this study, a graphical illustration of the probability density function and cumulative distribution function is presented. The statistical features of the Generalized Kumaraswamy Generalized Power Gompertz distribution are systematically derived and adequately studied. The estimation of the model parameters in the absence of censoring and under-right censoring is performed using the method of maximum likelihood. The test statistic for right-censored data, criteria test for GKGPG distribution, estimated matrix &#372;, &#264;, and &#284;, criteria test Y<sup>2</sup>n</sub>, alongside the quadratic form of the test statistic is derived. Mean simulated values of maximum likelihood estimates and their corresponding square mean errors are presented and confirmed to agree closely with the true parameter values. Simulated levels of significance for Y<sup>2</sup>n</sub> (γ) test for the GKGPG model against their theoretical values were recorded. We conclude that the null hypothesis for which simulated samples are fitted by GKGPG distribution is widely validated for the different levels of significance considered. From the summary of the results of the strength of a specific type of braided cord dataset on the GKGPG model, it is observed that the proposed GKGPG model fits the data set for a significance level ε = 0.05. 展开更多
关键词 Power Gompertz Generalized Kumaraswamy-G Modified chi-squared the Goodness of Fit CENSORING
下载PDF
留学生Medical Statistics线上课程建设与远程教学的实践与思考
16
作者 丁竞竞 钱炜春 +1 位作者 赵杨 张汝阳 《中国卫生统计》 CSCD 北大核心 2023年第6期942-945,949,共5页
受政治、经济和全球健康等因素影响,远程教育成为高等教育的一个发展趋势。新冠疫情防控期间,受出入境限制,未能返华的留学生一直以远程教学推进学业,成为其间持续进行远程教学最久的群体。本研究总结临床专业本科留学生主干课程Medical... 受政治、经济和全球健康等因素影响,远程教育成为高等教育的一个发展趋势。新冠疫情防控期间,受出入境限制,未能返华的留学生一直以远程教学推进学业,成为其间持续进行远程教学最久的群体。本研究总结临床专业本科留学生主干课程Medical Statistics的校级一流线上课程建设与教学实践,并比较了疫情前后,线下教学与远程教学的留学生期末考试成绩,发现远程教学成绩经历波动后逐渐稳定并接近传统线下教学。本文同时对效果影响因素进行了调研和分析,发现“充分学习和使用远程课程中丰富的资源”和“上网是否容易”是留学生学习效果的主要影响因素。提出建设丰富教学资源对促进医学统计学远程学习效果的重要影响,同时提出对策建议,以期进一步提高远程教学水平,服务高校现代化课程体系建设。 展开更多
关键词 线上课程建设 远程教学实践 Medical statistics 医学统计学 本科留学生
下载PDF
The impact of genotyping strategies and statistical models on accuracy of genomic prediction for survival in pigs 被引量:1
17
作者 Tianfei Liu Bjarne Nielsen +2 位作者 Ole F.Christensen Mogens SandøLund Guosheng Su 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第3期908-916,共9页
Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore ... Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore the impact of genotyping strategies and statistical models on the accuracy of genomic prediction for survival in pigs during the total growing period from birth to slaughter.Results:We simulated pig populations with different direct and maternal heritabilities and used a linear mixed model,a logit model,and a probit model to predict genomic breeding values of pig survival based on data of individual survival records with binary outcomes(0,1).The results show that in the case of only alive animals having genotype data,unbiased genomic predictions can be achieved when using variances estimated from pedigreebased model.Models using genomic information achieved up to 59.2%higher accuracy of estimated breeding value compared to pedigree-based model,dependent on genotyping scenarios.The scenario of genotyping all individuals,both dead and alive individuals,obtained the highest accuracy.When an equal number of individuals(80%)were genotyped,random sample of individuals with genotypes achieved higher accuracy than only alive individuals with genotypes.The linear model,logit model and probit model achieved similar accuracy.Conclusions:Our conclusion is that genomic prediction of pig survival is feasible in the situation that only alive pigs have genotypes,but genomic information of dead individuals can increase accuracy of genomic prediction by 2.06%to 6.04%. 展开更多
关键词 Genomic prediction Genotyping strategy Simulation statistical models SURVIVAL
下载PDF
Chi-Square and PCA Based Feature Selection for Diabetes Detection with Ensemble Classifier
18
作者 Vaibhav Rupapara Furqan Rustam +2 位作者 Abid Ishaq Ernesto Lee Imran Ashraf 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期1931-1949,共19页
Diabetes mellitus is a metabolic disease that is ranked among the top 10 causes of death by the world health organization.During the last few years,an alarming increase is observed worldwide with a 70%rise in the dise... Diabetes mellitus is a metabolic disease that is ranked among the top 10 causes of death by the world health organization.During the last few years,an alarming increase is observed worldwide with a 70%rise in the disease since 2000 and an 80%rise in male deaths.If untreated,it results in complications of many vital organs of the human body which may lead to fatality.Early detection of diabetes is a task of significant importance to start timely treatment.This study introduces a methodology for the classification of diabetic and normal people using an ensemble machine learning model and feature fusion of Chi-square and principal component analysis.An ensemble model,logistic tree classifier(LTC),is proposed which incorporates logistic regression and extra tree classifier through a soft voting mechanism.Experiments are also performed using several well-known machine learning algorithms to analyze their performance including logistic regression,extra tree classifier,AdaBoost,Gaussian naive Bayes,decision tree,random forest,and k nearest neighbor.In addition,several experiments are carried out using principal component analysis(PCA)and Chi-square(Chi-2)fea-tures to analyze the influence of feature selection on the performance of machine learning classifiers.Results indicate that Chi-2 features show high performance than both PCA features and original features.However,the highest accuracy is obtained when the proposed ensemble model LTC is used with the proposed fea-ture fusion framework-work which achieves a 0.85 accuracy score which is the highest of the available approaches for diabetes prediction.In addition,the statis-tical T-test proves the statistical significance of the proposed approach over other approaches. 展开更多
关键词 Diabetes mellitus prediction feature fusion ensemble classifier principal component analysis chi-square
下载PDF
Statistical Properties of Alfvén Ion Cyclotron Waves and Kinetic Alfvén Waves in the Inner Heliosphere
19
作者 Chang Sun Lei Yang +4 位作者 Qiu-Huan Li Cun-Li Dai Jian-Ping Li Zheng-Wei Cheng De-Jin Wu 《Research in Astronomy and Astrophysics》 SCIE CAS CSCD 2023年第9期341-350,共10页
Alfvén ion cyclotron waves(ACWs)and kinetic Alfvén waves(KAWs)are found to exist at<0.3 au observed by Parker Solar Probe in Alfvénic slow solar winds.To examine the statistical properties of the bac... Alfvén ion cyclotron waves(ACWs)and kinetic Alfvén waves(KAWs)are found to exist at<0.3 au observed by Parker Solar Probe in Alfvénic slow solar winds.To examine the statistical properties of the background parameters for ACWs and KAWs and related wave disturbances,both wave events observed by Parker Solar Probe are selected and analyzed.The results show that there are obvious differences in the background and disturbance parameters between ACWs and KAWs.ACW events have a relatively higher occurrence rate but with a total duration slightly shorter than KAW events.The median background magnetic field magnitude and the related background solar wind speed of KAW events are larger than those of ACWs.The distributions of the relative disturbances of the proton velocity,proton temperature,the proton number density,andβcover wider ranges for ACW events than for KAW events.The results may be important for the understanding of the nature and characteristics of Alfvénic slow solar wind fluctuations at ion scales near the Sun,and provide the information of the background field and plasma parameters and the wave disturbances of ACWs and KAWs for further relevant theoretical modeling or numerical simulations. 展开更多
关键词 (Sun )solar wind-plasmas-waves-methods statisticAL
下载PDF
A Comprehensive Guide for Selecting Appropriate Statistical Tests: Understanding When to Use Parametric and Nonparametric Tests
20
作者 Saed Jama Abdi 《Open Journal of Statistics》 2023年第4期464-474,共11页
Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Kn... Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated. 展开更多
关键词 statistical Tests Levels of Measurement PARAMETRIC NONPARAMETRIC Normal Distribution
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部