In this work, four empirical models of statistical thickness, namely the models of Harkins and Jura, Hasley, Carbon Black and Jaroniec, were compared in order to determine the textural properties (external surface and...In this work, four empirical models of statistical thickness, namely the models of Harkins and Jura, Hasley, Carbon Black and Jaroniec, were compared in order to determine the textural properties (external surface and surface of micropores) of a clay concrete without molasses and clay concretes stabilized with 8%, 12% and 16% molasses. The results obtained show that Hasley’s model can be used to obtain the external surfaces. However, it does not allow the surface of the micropores to be obtained, and is not suitable for the case of simple clay concrete (without molasses) and for clay concretes stabilized with molasses. The Carbon Black, Jaroniec and Harkins and Jura models can be used for clay concrete and stabilized clay concrete. However, the Carbon Black model is the most relevant for clay concrete and the Harkins and Jura model is for molasses-stabilized clay concrete. These last two models augur well for future research.展开更多
The present study aims to establish a relationship between serum AMH levels and age in a large group of women living in Bulgaria, as well as to establish reference age-specific AMH levels in women that would serve as ...The present study aims to establish a relationship between serum AMH levels and age in a large group of women living in Bulgaria, as well as to establish reference age-specific AMH levels in women that would serve as an initial estimate of ovarian age. A total of 28,016 women on the territory of the Republic of Bulgaria were tested for serum AMH levels with a median age of 37.0 years (interquartile range 32.0 to 41.0). For women aged 20 - 29 years, the Bulgarian population has relatively high median levels of AMH, similar to women of Asian origin. For women aged 30 - 34 years, our results are comparable to those of women living in Western Europe. For women aged 35 - 39 years, our results are comparable to those of women living in the territory of India and Kenya. For women aged 40 - 44 years, our results were lower than those for women from the Western European and Chinese populations, close to the Indian and higher than Korean and Kenya populations, respectively. Our results for women of Bulgarian origin are also comparable to US Latina women at age 30, 35 and 40 ages. On the base on constructed a statistical model to predicting the decline in AMH levels at different ages, we found non-linear structure of AMH decline for the low AMH 3.5) the dependence of the decline of AMH on age was confirmed as linear. In conclusion, we evaluated the serum level of AMH in Bulgarian women and established age-specific AMH percentile reference values based on a large representative sample. We have developed a prognostic statistical model that can facilitate the application of AMH in clinical practice and the prediction of reproductive capacity and population health.展开更多
Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore ...Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore the impact of genotyping strategies and statistical models on the accuracy of genomic prediction for survival in pigs during the total growing period from birth to slaughter.Results:We simulated pig populations with different direct and maternal heritabilities and used a linear mixed model,a logit model,and a probit model to predict genomic breeding values of pig survival based on data of individual survival records with binary outcomes(0,1).The results show that in the case of only alive animals having genotype data,unbiased genomic predictions can be achieved when using variances estimated from pedigreebased model.Models using genomic information achieved up to 59.2%higher accuracy of estimated breeding value compared to pedigree-based model,dependent on genotyping scenarios.The scenario of genotyping all individuals,both dead and alive individuals,obtained the highest accuracy.When an equal number of individuals(80%)were genotyped,random sample of individuals with genotypes achieved higher accuracy than only alive individuals with genotypes.The linear model,logit model and probit model achieved similar accuracy.Conclusions:Our conclusion is that genomic prediction of pig survival is feasible in the situation that only alive pigs have genotypes,but genomic information of dead individuals can increase accuracy of genomic prediction by 2.06%to 6.04%.展开更多
Soil erosion is a crucial geo-environmental hazard worldwide that affects water quality and agriculture,decreases reservoir storage capacity due to sedimentation,and increases the danger of flooding and landslides.Thu...Soil erosion is a crucial geo-environmental hazard worldwide that affects water quality and agriculture,decreases reservoir storage capacity due to sedimentation,and increases the danger of flooding and landslides.Thus,this study uses geospatial modeling to produce soil erosion susceptibility maps(SESM)for the Hangu region,Khyber Pakhtunkhwa(KPK),Pakistan.The Hangu region,located in the Kohat Plateau of KPK,Pakistan,is particularly susceptible to soil erosion due to its unique geomorphological and climatic characteristics.Moreover,the Hangu region is characterized by a combination of steep slopes,variable rainfall patterns,diverse land use,and distinct soil types,all of which contribute to the complexity and severity of soil erosion processes.These factors necessitate a detailed and region-specific study to develop effective soil conservation strategies.In this research,we detected and mapped 1013 soil erosion points and prepared 12 predisposing factors(elevation,aspect,slope,Normalized Differentiate Vegetation Index(NDVI),drainage network,curvature,Land Use Land Cover(LULC),rainfall,lithology,contour,soil texture,and road network)of soil erosion using GIS platform.Additionally,GIS-based statistical models like the weight of evidence(WOE)and frequency ratio(FR)were applied to produce the SESM for the study area.The SESM was reclassified into four classes,i.e.,low,medium,high,and very high zone.The results of WOE for SESM show that 16.39%,33.02%,29.27%,and 21.30%of areas are covered by low,medium,high,and very high zones,respectively.In contrast,the FR results revealed that 16.50%,24.33%,35.55%,and 23.59%of the areas are occupied by low,medium,high,and very high classes.Furthermore,the reliability of applied models was evaluated using the Area Under Curve(AUC)technique.The validation results utilizing the area under curve showed that the success rate curve(SRC)and predicted rate curve(PRC)for WOE are 82%and 86%,respectively,while SRC and PRC for FR are 85%and 96%,respectively.The validation results revealed that the FR model performance is better and more reliable than the WOE.展开更多
Sunshine duration (S) based empirical equations have been employed in this study to estimate the daily global solar radiation on a horizontal surface (G) for six meteorological stations in Burundi. Those equations inc...Sunshine duration (S) based empirical equations have been employed in this study to estimate the daily global solar radiation on a horizontal surface (G) for six meteorological stations in Burundi. Those equations include the Ångström-Prescott linear model and four amongst its derivatives, i.e. logarithmic, exponential, power and quadratic functions. Monthly mean values of daily global solar radiation and sunshine duration data for a period of 20 to 23 years, from the Geographical Institute of Burundi (IGEBU), have been used. For any of the six stations, ten single or double linear regressions have been developed from the above-said five functions, to relate in terms of monthly mean values, the daily clearness index () to each of the next two kinds of relative sunshine duration (RSD): and . In those ratios, G<sub>0</sub>, S<sub>0 </sub>and stand for the extraterrestrial daily solar radiation on a horizontal surface, the day length and the modified day length taking into account the natural site’s horizon, respectively. According to the calculated mean values of the clearness index and the RSD, each station experiences a high number of fairly clear (or partially cloudy) days. Estimated values of the dependent variable (y) in each developed linear regression, have been compared to measured values in terms of the coefficients of correlation (R) and of determination (R<sub>2</sub>), the mean bias error (MBE), the root mean square error (RMSE) and the t-statistics. Mean values of these statistical indicators have been used to rank, according to decreasing performance level, firstly the ten developed equations per station on account of the overall six stations, secondly the six stations on account of the overall ten equations. Nevertheless, the obtained values of those indicators lay in the next ranges for all the developed sixty equations:;;;, with . These results lead to assert that any of the sixty developed linear regressions (and thus equations in terms of and ), fits very adequately measured data, and should be used to estimate monthly average daily global solar radiation with sunshine duration for the relevant station. It is also found that using as RSD, is slightly more advantageous than using for estimating the monthly average daily clearness index, . Moreover, values of statistical indicators of this study match adequately data from other works on the same kinds of empirical equations.展开更多
Statistical literacy is crucial for cultivating well-rounded thinkers.The integration of evidence-based strategies in teaching and learning is pivotal for enhancing students’statistical literacy.This research specifi...Statistical literacy is crucial for cultivating well-rounded thinkers.The integration of evidence-based strategies in teaching and learning is pivotal for enhancing students’statistical literacy.This research specifically focuses on the utilization of Share and Model Concepts and Nurturing Metacognition as evidence-based strategies aimed at improving the statistical literacy of learners.The study employed a quasi-experimental design,specifically the nonequivalent control group,wherein students answered pre-test and post-test instruments and researcher-made questionnaires.The study included 50 first-year Bachelor in Secondary Education majors in Mathematics and Science for the academic year 2023-2024.The results of the study revealed a significant difference in the scores of student respondents,indicating that the use of evidence-based strategies helped students enhance their statistical literacy.This signifies a noteworthy increase in their performance,ranging from very low to very high proficiency in understanding statistical concepts,insights into the application of statistical concepts,numeracy,graph skills,interpretation capabilities,and visualization and communication skills.Furthermore,the study showed a significant difference in the post-test scores’performance of the two groups in understanding statistical concepts and visualization and communication skills.However,no significant difference was found in the post-test scores of the two groups concerning insights into the application of statistical concepts,numeracy and graph skills,and interpretation capabilities.Additionally,students acknowledged that the implementation of evidence-based strategies significantly contributed to the improvement of their statistical literacy.展开更多
Recently,gender equality and women’s entrepreneurship have gained considerable attention in global economic development.Prior to the design of any policy interventions to increase women’s entrepreneurship,it is sign...Recently,gender equality and women’s entrepreneurship have gained considerable attention in global economic development.Prior to the design of any policy interventions to increase women’s entrepreneurship,it is significant to comprehend the factors motivating women to become entrepreneurs.The non-understanding of the factors can result in the endurance of low living stan-dards and the design of expensive and ineffectual policies.But female involve-ment in entrepreneurship becomes higher in developing economies compared to developed economies.Women Entrepreneurship Index(WEI)plays a vital role in determining the factors that enable theflourishment of high potential female entrepreneurs which enhances economic welfare and contributes to the economic and social fabric of society.Therefore,it is needed to design an automated and accurate WEI prediction model to improve women’s entrepreneurship.In this view,this article develops an automated statistical analysis enabled WEI predic-tive(ASA-WEIP)model.The proposed ASA-WEIP technique aims to effectually determine the WEI.The proposed ASA-WEIP technique encompasses a series of sub-processes such as pre-processing,WEI prediction,and parameter optimiza-tion.For the prediction of WEI,the ASA-WEIP technique makes use of the Deep Belief Network(DBN)model,and the parameter optimization process takes place using Squirrel Search Algorithm(SSA).The performance validation of the ASA-WEIP technique was executed using the benchmark dataset from the Kaggle repo-sitory.The experimental outcomes stated the better outcomes of the ASA-WEIP technique over the other existing techniques.展开更多
The railway mobile communication system is undergoing a smooth transition from the Global System for Mobile Communications-Railway(GSM-R)to the Railway 5G.In this paper,an empirical path loss model based on a large am...The railway mobile communication system is undergoing a smooth transition from the Global System for Mobile Communications-Railway(GSM-R)to the Railway 5G.In this paper,an empirical path loss model based on a large amount of measured data is established to predict the path loss in the Railway 5G marshalling yard scenario.According to the different characteristics of base station directional antennas,the antenna gain is verified.Then we propose the position of the breakpoint in the antenna propagation area,and based on the breakpoint segmentation,a large-scale statistical model for marshalling yards is established.展开更多
Landslide susceptibility mapping is vital for landslide risk management and urban planning.In this study,we used three statistical models[frequency ratio,certainty factor and index of entropy(IOE)]and a machine learni...Landslide susceptibility mapping is vital for landslide risk management and urban planning.In this study,we used three statistical models[frequency ratio,certainty factor and index of entropy(IOE)]and a machine learning model[random forest(RF)]for landslide susceptibility mapping in Wanzhou County,China.First,a landslide inventory map was prepared using earlier geotechnical investigation reports,aerial images,and field surveys.Then,the redundant factors were excluded from the initial fourteen landslide causal factors via factor correlation analysis.To determine the most effective causal factors,landslide susceptibility evaluations were performed based on four cases with different combinations of factors("cases").In the analysis,465(70%)landslide locations were randomly selected for model training,and 200(30%)landslide locations were selected for verification.The results showed that case 3 produced the best performance for the statistical models and that case 2 produced the best performance for the RF model.Finally,the receiver operating characteristic(ROC)curve was used to verify the accuracy of each model's results for its respective optimal case.The ROC curve analysis showed that the machine learning model performed better than the other three models,and among the three statistical models,the IOE model with weight coefficients was superior.展开更多
Several statistical methods have been developed for analyzing genotype×environment(GE)interactions in crop breeding programs to identify genotypes with high yield and stability performances.Four statistical metho...Several statistical methods have been developed for analyzing genotype×environment(GE)interactions in crop breeding programs to identify genotypes with high yield and stability performances.Four statistical methods,including joint regression analysis(JRA),additive mean effects and multiplicative interaction(AMMI)analysis,genotype plus GE interaction(GGE)biplot analysis,and yield–stability(YSi)statistic were used to evaluate GE interaction in20 winter wheat genotypes grown in 24 environments in Iran.The main objective was to evaluate the rank correlations among the four statistical methods in genotype rankings for yield,stability and yield–stability.Three kinds of genotypic ranks(yield ranks,stability ranks,and yield–stability ranks)were determined with each method.The results indicated the presence of GE interaction,suggesting the need for stability analysis.With respect to yield,the genotype rankings by the GGE biplot and AMMI analysis were significantly correlated(P<0.01).For stability ranking,the rank correlations ranged from 0.53(GGE–YSi;P<0.05)to0.97(JRA–YSi;P<0.01).AMMI distance(AMMID)was highly correlated(P<0.01)with variance of regression deviation(S2di)in JRA(r=0.83)and Shukla stability variance(σ2)in YSi(r=0.86),indicating that these stability indices can be used interchangeably.No correlation was found between yield ranks and stability ranks(AMMID,S2di,σ2,and GGE stability index),indicating that they measure static stability and accordingly could be used if selection is based primarily on stability.For yield–stability,rank correlation coefficients among the statistical methods varied from 0.64(JRA–YSi;P<0.01)to 0.89(AMMI–YSi;P<0.01),indicating that AMMI and YSi were closely associated in the genotype ranking for integrating yield with stability performance.Based on the results,it can be concluded that YSi was closely correlated with(i)JRA in ranking genotypes for stability and(ii)AMMI for integrating yield and stability.展开更多
The performance of six statistical approaches,which can be used for selection of the best model to describe the growth of individual fish,was analyzed using simulated and real length-at-age data.The six approaches inc...The performance of six statistical approaches,which can be used for selection of the best model to describe the growth of individual fish,was analyzed using simulated and real length-at-age data.The six approaches include coefficient of determination(R2),adjusted coefficient of determination(adj.-R2),root mean squared error(RMSE),Akaike's information criterion(AIC),bias correction of AIC(AICc) and Bayesian information criterion(BIC).The simulation data were generated by five growth models with different numbers of parameters.Four sets of real data were taken from the literature.The parameters in each of the five growth models were estimated using the maximum likelihood method under the assumption of the additive error structure for the data.The best supported model by the data was identified using each of the six approaches.The results show that R2 and RMSE have the same properties and perform worst.The sample size has an effect on the performance of adj.-R2,AIC,AICc and BIC.Adj.-R2 does better in small samples than in large samples.AIC is not suitable to use in small samples and tends to select more complex model when the sample size becomes large.AICc and BIC have best performance in small and large sample cases,respectively.Use of AICc or BIC is recommended for selection of fish growth model according to the size of the length-at-age data.展开更多
Based on the review and comparison of main statistical analysis models for estimating variety-environment cell means in regional crop trials, a new statistical model, LR-PCA composite model was proposed, and the predi...Based on the review and comparison of main statistical analysis models for estimating variety-environment cell means in regional crop trials, a new statistical model, LR-PCA composite model was proposed, and the predictive precision of these models were compared by cross validation of an example data. Results showed that the order of model precision was LR-PCA model > AMMI model > PCA model > Treatment Means (TM) model > Linear Regression (LR) model > Additive Main Effects ANOVA model. The precision gain factor of LR-PCA model was 1.55, increasing by 8.4% compared with AMMI.展开更多
This work correlated the detailed work zone location and time data from the Wis LCS system with the five-min inductive loop detector data. One-sample percentile value test and two-sample Kolmogorov-Smirnov(K-S) test w...This work correlated the detailed work zone location and time data from the Wis LCS system with the five-min inductive loop detector data. One-sample percentile value test and two-sample Kolmogorov-Smirnov(K-S) test were applied to compare the speed and flow characteristics between work zone and non-work zone conditions. Furthermore, we analyzed the mobility characteristics of freeway work zones within the urban area of Milwaukee, WI, USA. More than 50% of investigated work zones have experienced speed reduction and 15%-30% is necessary reduced volumes. Speed reduction was more significant within and at the downstream of work zones than at the upstream.展开更多
QTL mapping for seven quality traits was conducted by using 254 recombinant inbred lines (RIL) derived from a japonica-japonica rice cross of Xiushui 79/C Bao. The seven traits investigated were grain length (GL),...QTL mapping for seven quality traits was conducted by using 254 recombinant inbred lines (RIL) derived from a japonica-japonica rice cross of Xiushui 79/C Bao. The seven traits investigated were grain length (GL), grain length to width ratio (LWR), chalk grain rate (CGR), chalkiness degree (CD), gelatinization temperature (GT), amylose content (AC) and gel consistency (GC) of head rice. Three mapping methods employed were composite interval mapping in QTLMapper 2.0 software based on mixed linear model (MCIM), inclusive composite interval mapping in QTL IciMapping 3.0 software based on stepwise regression linear model (ICIM) and multiple interval mapping with regression forward selection in Windows QTL Cartographer 2.5 based on multiple regression analysis (MIMR). Results showed that five QTLs with additive effect (A-QTLs) were detected by all the three methods simultaneously, two by two methods simultaneously, and 23 by only one method. Five A-QTLs were detected by MCIM, nine by ICIM and 28 by MIMR. The contribution rates of single A-QTL ranged from 0.89% to 38.07%. All the QTLs with epistatic effect (E-QTLs) detected by MIMR were not detected by the other two methods. Fourteen pairs of E-QTLs were detected by both MCIM and ICIM, and 142 pairs of E-QTLs were detected by only one method. Twenty-five pairs of E-QTLs were detected by MCIM, 141 pairs by ICIM and four pairs by MIMR. The contribution rates of single pair of E-QTL were from 2.60% to 23.78%. In the Xiu-Bao RIL population, epistatic effect played a major role in the variation of GL and CD, and additive effect was the dominant in the variation of LWR, while both epistatic effect and additive effect had equal importance in the variation of CGR, AC, GT and GC. QTLs detected by two or more methods simultaneously were highly reliable, and could be applied to improve the quality traits in japonica hybrid rice.展开更多
The water resources of the Nadhour-Sisseb-El Alem Basin in Tunisia exhibit semi-arid and arid climatic conditions.This induces an excessive pumping of groundwater,which creates drops in water level ranging about 1-2 m...The water resources of the Nadhour-Sisseb-El Alem Basin in Tunisia exhibit semi-arid and arid climatic conditions.This induces an excessive pumping of groundwater,which creates drops in water level ranging about 1-2 m/a.Indeed,these unfavorable conditions require interventions to rationalize integrated management in decision making.The aim of this study is to determine a water recharge index(WRI),delineate the potential groundwater recharge area and estimate the potential groundwater recharge rate based on the integration of statistical models resulted from remote sensing imagery,GIS digital data(e.g.,lithology,soil,runoff),measured artificial recharge data,fuzzy set theory and multi-criteria decision making(MCDM)using the analytical hierarchy process(AHP).Eight factors affecting potential groundwater recharge were determined,namely lithology,soil,slope,topography,land cover/use,runoff,drainage and lineaments.The WRI is between 1.2 and 3.1,which is classified into five classes as poor,weak,moderate,good and very good sites of potential groundwater recharge area.The very good and good classes occupied respectively 27%and 44%of the study area.The potential groundwater recharge rate was 43%of total precipitation.According to the results of the study,river beds are favorable sites for groundwater recharge.展开更多
Forecasting the movement of stock market is a long-time attractive topic. This paper implements different statistical learning models to predict the movement of S&P 500 index. The S&P 500 index is influenced b...Forecasting the movement of stock market is a long-time attractive topic. This paper implements different statistical learning models to predict the movement of S&P 500 index. The S&P 500 index is influenced by other important financial indexes across the world such as commodity price and financial technical indicators. This paper systematically investigated four supervised learning models, including Logistic Regression, Gaussian Discriminant Analysis (GDA), Naive Bayes and Support Vector Machine (SVM) in the forecast of S&P 500 index. After several experiments of optimization in features and models, especially the SVM kernel selection and feature selection for different models, this paper concludes that a SVM model with a Radial Basis Function (RBF) kernel can achieve an accuracy rate of 62.51% for the future market trend of the S&P 500 index.展开更多
Following the basic principle of modem multivariate statistical analysis theory, the description model, prediction model and control model to relate chemical compositions and mechanical properties of steels are introd...Following the basic principle of modem multivariate statistical analysis theory, the description model, prediction model and control model to relate chemical compositions and mechanical properties of steels are introduced. As an example, the total flowchart of components and structure/properties description, prediction and control model for chemical composition and mechanical properties of 20 and A_2 steel are presented.展开更多
Semi-rigid liquid crystal polymer is a class of liquid crystal polymers different from long rigid rod liquid crystal polymer to which the well-known Onsager and Flory theories are applied. In this paper, three statist...Semi-rigid liquid crystal polymer is a class of liquid crystal polymers different from long rigid rod liquid crystal polymer to which the well-known Onsager and Flory theories are applied. In this paper, three statistical models for the semi-rigid nematic polymer were addressed. They are the elastically jointed rod model, worm-like chain model, and non-homogeneous chain model. The nematic-isotropic transition temperature was examined. The pseudo-second transition temperature is expressed analytically. Comparisons with the experiments were made and the agreements were found.展开更多
This paper describes the stochastic model of the scattered electromagnetic field.Unlike common_used functional_determined models the proposed is characterised by amplitude/phase fluctuation of the received signal.This...This paper describes the stochastic model of the scattered electromagnetic field.Unlike common_used functional_determined models the proposed is characterised by amplitude/phase fluctuation of the received signal.This paper derives the statistical characteristic of the input signal and describes algorithm for its estimation in post_processing and real_time processing modes.Achieved characteristics allow the mapping and estimation of the surface models more accurate,moreover,such processing increase space resolution of synthetic aperture radar.展开更多
The objective of this study is to analyze the sensitivity of the statistical models regarding the size of samples. The study carried out in Ivory Coast is based on annual maximum daily rainfall data collected from 26 ...The objective of this study is to analyze the sensitivity of the statistical models regarding the size of samples. The study carried out in Ivory Coast is based on annual maximum daily rainfall data collected from 26 stations. The methodological approach is based on the statistical modeling of maximum daily rainfall. Adjustments were made on several sample sizes and several return periods (2, 5, 10, 20, 50 and 100 years). The main results have shown that the 30 years series (1931-1960;1961-1990;1991-2020) are better adjusted by the Gumbel (26.92% - 53.85%) and Inverse Gamma (26.92% - 46.15%). Concerning the 60-years series (1931-1990;1961-2020), they are better adjusted by the Inverse Gamma (30.77%), Gamma (15.38% - 46.15%) and Gumbel (15.38% - 42.31%). The full chronicle 1931-2020 (90 years) presents a notable supremacy of 50% of Gumbel model over the Gamma (34.62%) and Gamma Inverse (15.38%) model. It is noted that the Gumbel is the most dominant model overall and more particularly in wet periods. The data for periods with normal and dry trends were better fitted by Gamma and Inverse Gamma.展开更多
文摘In this work, four empirical models of statistical thickness, namely the models of Harkins and Jura, Hasley, Carbon Black and Jaroniec, were compared in order to determine the textural properties (external surface and surface of micropores) of a clay concrete without molasses and clay concretes stabilized with 8%, 12% and 16% molasses. The results obtained show that Hasley’s model can be used to obtain the external surfaces. However, it does not allow the surface of the micropores to be obtained, and is not suitable for the case of simple clay concrete (without molasses) and for clay concretes stabilized with molasses. The Carbon Black, Jaroniec and Harkins and Jura models can be used for clay concrete and stabilized clay concrete. However, the Carbon Black model is the most relevant for clay concrete and the Harkins and Jura model is for molasses-stabilized clay concrete. These last two models augur well for future research.
文摘The present study aims to establish a relationship between serum AMH levels and age in a large group of women living in Bulgaria, as well as to establish reference age-specific AMH levels in women that would serve as an initial estimate of ovarian age. A total of 28,016 women on the territory of the Republic of Bulgaria were tested for serum AMH levels with a median age of 37.0 years (interquartile range 32.0 to 41.0). For women aged 20 - 29 years, the Bulgarian population has relatively high median levels of AMH, similar to women of Asian origin. For women aged 30 - 34 years, our results are comparable to those of women living in Western Europe. For women aged 35 - 39 years, our results are comparable to those of women living in the territory of India and Kenya. For women aged 40 - 44 years, our results were lower than those for women from the Western European and Chinese populations, close to the Indian and higher than Korean and Kenya populations, respectively. Our results for women of Bulgarian origin are also comparable to US Latina women at age 30, 35 and 40 ages. On the base on constructed a statistical model to predicting the decline in AMH levels at different ages, we found non-linear structure of AMH decline for the low AMH 3.5) the dependence of the decline of AMH on age was confirmed as linear. In conclusion, we evaluated the serum level of AMH in Bulgarian women and established age-specific AMH percentile reference values based on a large representative sample. We have developed a prognostic statistical model that can facilitate the application of AMH in clinical practice and the prediction of reproductive capacity and population health.
基金funded by the"Genetic improvement of pig survival"project from Danish Pig Levy Foundation (Aarhus,Denmark)The China Scholarship Council (CSC)for providing scholarship to the first author。
文摘Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore the impact of genotyping strategies and statistical models on the accuracy of genomic prediction for survival in pigs during the total growing period from birth to slaughter.Results:We simulated pig populations with different direct and maternal heritabilities and used a linear mixed model,a logit model,and a probit model to predict genomic breeding values of pig survival based on data of individual survival records with binary outcomes(0,1).The results show that in the case of only alive animals having genotype data,unbiased genomic predictions can be achieved when using variances estimated from pedigreebased model.Models using genomic information achieved up to 59.2%higher accuracy of estimated breeding value compared to pedigree-based model,dependent on genotyping scenarios.The scenario of genotyping all individuals,both dead and alive individuals,obtained the highest accuracy.When an equal number of individuals(80%)were genotyped,random sample of individuals with genotypes achieved higher accuracy than only alive individuals with genotypes.The linear model,logit model and probit model achieved similar accuracy.Conclusions:Our conclusion is that genomic prediction of pig survival is feasible in the situation that only alive pigs have genotypes,but genomic information of dead individuals can increase accuracy of genomic prediction by 2.06%to 6.04%.
基金The authors extend their appreciation to Researchers Supporting Project number(RSP2024R390),King Saud University,Riyadh,Saudi Arabia.
文摘Soil erosion is a crucial geo-environmental hazard worldwide that affects water quality and agriculture,decreases reservoir storage capacity due to sedimentation,and increases the danger of flooding and landslides.Thus,this study uses geospatial modeling to produce soil erosion susceptibility maps(SESM)for the Hangu region,Khyber Pakhtunkhwa(KPK),Pakistan.The Hangu region,located in the Kohat Plateau of KPK,Pakistan,is particularly susceptible to soil erosion due to its unique geomorphological and climatic characteristics.Moreover,the Hangu region is characterized by a combination of steep slopes,variable rainfall patterns,diverse land use,and distinct soil types,all of which contribute to the complexity and severity of soil erosion processes.These factors necessitate a detailed and region-specific study to develop effective soil conservation strategies.In this research,we detected and mapped 1013 soil erosion points and prepared 12 predisposing factors(elevation,aspect,slope,Normalized Differentiate Vegetation Index(NDVI),drainage network,curvature,Land Use Land Cover(LULC),rainfall,lithology,contour,soil texture,and road network)of soil erosion using GIS platform.Additionally,GIS-based statistical models like the weight of evidence(WOE)and frequency ratio(FR)were applied to produce the SESM for the study area.The SESM was reclassified into four classes,i.e.,low,medium,high,and very high zone.The results of WOE for SESM show that 16.39%,33.02%,29.27%,and 21.30%of areas are covered by low,medium,high,and very high zones,respectively.In contrast,the FR results revealed that 16.50%,24.33%,35.55%,and 23.59%of the areas are occupied by low,medium,high,and very high classes.Furthermore,the reliability of applied models was evaluated using the Area Under Curve(AUC)technique.The validation results utilizing the area under curve showed that the success rate curve(SRC)and predicted rate curve(PRC)for WOE are 82%and 86%,respectively,while SRC and PRC for FR are 85%and 96%,respectively.The validation results revealed that the FR model performance is better and more reliable than the WOE.
文摘Sunshine duration (S) based empirical equations have been employed in this study to estimate the daily global solar radiation on a horizontal surface (G) for six meteorological stations in Burundi. Those equations include the Ångström-Prescott linear model and four amongst its derivatives, i.e. logarithmic, exponential, power and quadratic functions. Monthly mean values of daily global solar radiation and sunshine duration data for a period of 20 to 23 years, from the Geographical Institute of Burundi (IGEBU), have been used. For any of the six stations, ten single or double linear regressions have been developed from the above-said five functions, to relate in terms of monthly mean values, the daily clearness index () to each of the next two kinds of relative sunshine duration (RSD): and . In those ratios, G<sub>0</sub>, S<sub>0 </sub>and stand for the extraterrestrial daily solar radiation on a horizontal surface, the day length and the modified day length taking into account the natural site’s horizon, respectively. According to the calculated mean values of the clearness index and the RSD, each station experiences a high number of fairly clear (or partially cloudy) days. Estimated values of the dependent variable (y) in each developed linear regression, have been compared to measured values in terms of the coefficients of correlation (R) and of determination (R<sub>2</sub>), the mean bias error (MBE), the root mean square error (RMSE) and the t-statistics. Mean values of these statistical indicators have been used to rank, according to decreasing performance level, firstly the ten developed equations per station on account of the overall six stations, secondly the six stations on account of the overall ten equations. Nevertheless, the obtained values of those indicators lay in the next ranges for all the developed sixty equations:;;;, with . These results lead to assert that any of the sixty developed linear regressions (and thus equations in terms of and ), fits very adequately measured data, and should be used to estimate monthly average daily global solar radiation with sunshine duration for the relevant station. It is also found that using as RSD, is slightly more advantageous than using for estimating the monthly average daily clearness index, . Moreover, values of statistical indicators of this study match adequately data from other works on the same kinds of empirical equations.
文摘Statistical literacy is crucial for cultivating well-rounded thinkers.The integration of evidence-based strategies in teaching and learning is pivotal for enhancing students’statistical literacy.This research specifically focuses on the utilization of Share and Model Concepts and Nurturing Metacognition as evidence-based strategies aimed at improving the statistical literacy of learners.The study employed a quasi-experimental design,specifically the nonequivalent control group,wherein students answered pre-test and post-test instruments and researcher-made questionnaires.The study included 50 first-year Bachelor in Secondary Education majors in Mathematics and Science for the academic year 2023-2024.The results of the study revealed a significant difference in the scores of student respondents,indicating that the use of evidence-based strategies helped students enhance their statistical literacy.This signifies a noteworthy increase in their performance,ranging from very low to very high proficiency in understanding statistical concepts,insights into the application of statistical concepts,numeracy,graph skills,interpretation capabilities,and visualization and communication skills.Furthermore,the study showed a significant difference in the post-test scores’performance of the two groups in understanding statistical concepts and visualization and communication skills.However,no significant difference was found in the post-test scores of the two groups concerning insights into the application of statistical concepts,numeracy and graph skills,and interpretation capabilities.Additionally,students acknowledged that the implementation of evidence-based strategies significantly contributed to the improvement of their statistical literacy.
文摘Recently,gender equality and women’s entrepreneurship have gained considerable attention in global economic development.Prior to the design of any policy interventions to increase women’s entrepreneurship,it is significant to comprehend the factors motivating women to become entrepreneurs.The non-understanding of the factors can result in the endurance of low living stan-dards and the design of expensive and ineffectual policies.But female involve-ment in entrepreneurship becomes higher in developing economies compared to developed economies.Women Entrepreneurship Index(WEI)plays a vital role in determining the factors that enable theflourishment of high potential female entrepreneurs which enhances economic welfare and contributes to the economic and social fabric of society.Therefore,it is needed to design an automated and accurate WEI prediction model to improve women’s entrepreneurship.In this view,this article develops an automated statistical analysis enabled WEI predic-tive(ASA-WEIP)model.The proposed ASA-WEIP technique aims to effectually determine the WEI.The proposed ASA-WEIP technique encompasses a series of sub-processes such as pre-processing,WEI prediction,and parameter optimiza-tion.For the prediction of WEI,the ASA-WEIP technique makes use of the Deep Belief Network(DBN)model,and the parameter optimization process takes place using Squirrel Search Algorithm(SSA).The performance validation of the ASA-WEIP technique was executed using the benchmark dataset from the Kaggle repo-sitory.The experimental outcomes stated the better outcomes of the ASA-WEIP technique over the other existing techniques.
基金supported in part by the Fundamental Research Funds for the Central Universities under Grant No.2022JBXT001in part by NS⁃FC under Grant No.62171021+1 种基金in part by the Project of China State Rail⁃way Group under Grant No.P2021G012in part by ZTE Industry⁃University⁃Institute Cooperation Funds under Grant No.I21L00220.
文摘The railway mobile communication system is undergoing a smooth transition from the Global System for Mobile Communications-Railway(GSM-R)to the Railway 5G.In this paper,an empirical path loss model based on a large amount of measured data is established to predict the path loss in the Railway 5G marshalling yard scenario.According to the different characteristics of base station directional antennas,the antenna gain is verified.Then we propose the position of the breakpoint in the antenna propagation area,and based on the breakpoint segmentation,a large-scale statistical model for marshalling yards is established.
基金the projects ‘‘The risk assessment of geological hazards induced by reservoir water level fluctuation in Chongqing, Three-Gorges Reservoir, China.’’ (No. 2016065135)‘‘The study of mechanism and forecast criterion of the gentle-dip landslides in The Three Gorges Reservoir Region, China’’ (No. 41572292) funded by the National Natural Science Foundation of China
文摘Landslide susceptibility mapping is vital for landslide risk management and urban planning.In this study,we used three statistical models[frequency ratio,certainty factor and index of entropy(IOE)]and a machine learning model[random forest(RF)]for landslide susceptibility mapping in Wanzhou County,China.First,a landslide inventory map was prepared using earlier geotechnical investigation reports,aerial images,and field surveys.Then,the redundant factors were excluded from the initial fourteen landslide causal factors via factor correlation analysis.To determine the most effective causal factors,landslide susceptibility evaluations were performed based on four cases with different combinations of factors("cases").In the analysis,465(70%)landslide locations were randomly selected for model training,and 200(30%)landslide locations were selected for verification.The results showed that case 3 produced the best performance for the statistical models and that case 2 produced the best performance for the RF model.Finally,the receiver operating characteristic(ROC)curve was used to verify the accuracy of each model's results for its respective optimal case.The ROC curve analysis showed that the machine learning model performed better than the other three models,and among the three statistical models,the IOE model with weight coefficients was superior.
基金the bread wheat project of the Dryland Agricultural Research Institute (DARI)supported by the Agricultural Research and Education Organization (AREO) of Iran
文摘Several statistical methods have been developed for analyzing genotype×environment(GE)interactions in crop breeding programs to identify genotypes with high yield and stability performances.Four statistical methods,including joint regression analysis(JRA),additive mean effects and multiplicative interaction(AMMI)analysis,genotype plus GE interaction(GGE)biplot analysis,and yield–stability(YSi)statistic were used to evaluate GE interaction in20 winter wheat genotypes grown in 24 environments in Iran.The main objective was to evaluate the rank correlations among the four statistical methods in genotype rankings for yield,stability and yield–stability.Three kinds of genotypic ranks(yield ranks,stability ranks,and yield–stability ranks)were determined with each method.The results indicated the presence of GE interaction,suggesting the need for stability analysis.With respect to yield,the genotype rankings by the GGE biplot and AMMI analysis were significantly correlated(P<0.01).For stability ranking,the rank correlations ranged from 0.53(GGE–YSi;P<0.05)to0.97(JRA–YSi;P<0.01).AMMI distance(AMMID)was highly correlated(P<0.01)with variance of regression deviation(S2di)in JRA(r=0.83)and Shukla stability variance(σ2)in YSi(r=0.86),indicating that these stability indices can be used interchangeably.No correlation was found between yield ranks and stability ranks(AMMID,S2di,σ2,and GGE stability index),indicating that they measure static stability and accordingly could be used if selection is based primarily on stability.For yield–stability,rank correlation coefficients among the statistical methods varied from 0.64(JRA–YSi;P<0.01)to 0.89(AMMI–YSi;P<0.01),indicating that AMMI and YSi were closely associated in the genotype ranking for integrating yield with stability performance.Based on the results,it can be concluded that YSi was closely correlated with(i)JRA in ranking genotypes for stability and(ii)AMMI for integrating yield and stability.
基金Supported by the High Technology Research and Development Program of China (863 Program,No2006AA100301)
文摘The performance of six statistical approaches,which can be used for selection of the best model to describe the growth of individual fish,was analyzed using simulated and real length-at-age data.The six approaches include coefficient of determination(R2),adjusted coefficient of determination(adj.-R2),root mean squared error(RMSE),Akaike's information criterion(AIC),bias correction of AIC(AICc) and Bayesian information criterion(BIC).The simulation data were generated by five growth models with different numbers of parameters.Four sets of real data were taken from the literature.The parameters in each of the five growth models were estimated using the maximum likelihood method under the assumption of the additive error structure for the data.The best supported model by the data was identified using each of the six approaches.The results show that R2 and RMSE have the same properties and perform worst.The sample size has an effect on the performance of adj.-R2,AIC,AICc and BIC.Adj.-R2 does better in small samples than in large samples.AIC is not suitable to use in small samples and tends to select more complex model when the sample size becomes large.AICc and BIC have best performance in small and large sample cases,respectively.Use of AICc or BIC is recommended for selection of fish growth model according to the size of the length-at-age data.
文摘Based on the review and comparison of main statistical analysis models for estimating variety-environment cell means in regional crop trials, a new statistical model, LR-PCA composite model was proposed, and the predictive precision of these models were compared by cross validation of an example data. Results showed that the order of model precision was LR-PCA model > AMMI model > PCA model > Treatment Means (TM) model > Linear Regression (LR) model > Additive Main Effects ANOVA model. The precision gain factor of LR-PCA model was 1.55, increasing by 8.4% compared with AMMI.
基金Project(61620106002)supported by the National Natural Science Foundation of ChinaProject(2016YFB0100906)supported by the National Key R&D Program in China+1 种基金Project(2015364X16030)supported by the Information Technology Research Project of Ministry of Transport of ChinaProject(2242015K42132)supported by the Fundamental Sciences of Southeast University,China
文摘This work correlated the detailed work zone location and time data from the Wis LCS system with the five-min inductive loop detector data. One-sample percentile value test and two-sample Kolmogorov-Smirnov(K-S) test were applied to compare the speed and flow characteristics between work zone and non-work zone conditions. Furthermore, we analyzed the mobility characteristics of freeway work zones within the urban area of Milwaukee, WI, USA. More than 50% of investigated work zones have experienced speed reduction and 15%-30% is necessary reduced volumes. Speed reduction was more significant within and at the downstream of work zones than at the upstream.
基金supported by the National High Technology Research and Development Program of China (Grant No. 2010AA101301)the Program of Introducing International Advanced Agricultural Science and Technology in China (Grant No. 2006-G8[4]-31-1)the Program of Science-Technology Basis and Conditional Platform in China (Grant No. 505005)
文摘QTL mapping for seven quality traits was conducted by using 254 recombinant inbred lines (RIL) derived from a japonica-japonica rice cross of Xiushui 79/C Bao. The seven traits investigated were grain length (GL), grain length to width ratio (LWR), chalk grain rate (CGR), chalkiness degree (CD), gelatinization temperature (GT), amylose content (AC) and gel consistency (GC) of head rice. Three mapping methods employed were composite interval mapping in QTLMapper 2.0 software based on mixed linear model (MCIM), inclusive composite interval mapping in QTL IciMapping 3.0 software based on stepwise regression linear model (ICIM) and multiple interval mapping with regression forward selection in Windows QTL Cartographer 2.5 based on multiple regression analysis (MIMR). Results showed that five QTLs with additive effect (A-QTLs) were detected by all the three methods simultaneously, two by two methods simultaneously, and 23 by only one method. Five A-QTLs were detected by MCIM, nine by ICIM and 28 by MIMR. The contribution rates of single A-QTL ranged from 0.89% to 38.07%. All the QTLs with epistatic effect (E-QTLs) detected by MIMR were not detected by the other two methods. Fourteen pairs of E-QTLs were detected by both MCIM and ICIM, and 142 pairs of E-QTLs were detected by only one method. Twenty-five pairs of E-QTLs were detected by MCIM, 141 pairs by ICIM and four pairs by MIMR. The contribution rates of single pair of E-QTL were from 2.60% to 23.78%. In the Xiu-Bao RIL population, epistatic effect played a major role in the variation of GL and CD, and additive effect was the dominant in the variation of LWR, while both epistatic effect and additive effect had equal importance in the variation of CGR, AC, GT and GC. QTLs detected by two or more methods simultaneously were highly reliable, and could be applied to improve the quality traits in japonica hybrid rice.
文摘The water resources of the Nadhour-Sisseb-El Alem Basin in Tunisia exhibit semi-arid and arid climatic conditions.This induces an excessive pumping of groundwater,which creates drops in water level ranging about 1-2 m/a.Indeed,these unfavorable conditions require interventions to rationalize integrated management in decision making.The aim of this study is to determine a water recharge index(WRI),delineate the potential groundwater recharge area and estimate the potential groundwater recharge rate based on the integration of statistical models resulted from remote sensing imagery,GIS digital data(e.g.,lithology,soil,runoff),measured artificial recharge data,fuzzy set theory and multi-criteria decision making(MCDM)using the analytical hierarchy process(AHP).Eight factors affecting potential groundwater recharge were determined,namely lithology,soil,slope,topography,land cover/use,runoff,drainage and lineaments.The WRI is between 1.2 and 3.1,which is classified into five classes as poor,weak,moderate,good and very good sites of potential groundwater recharge area.The very good and good classes occupied respectively 27%and 44%of the study area.The potential groundwater recharge rate was 43%of total precipitation.According to the results of the study,river beds are favorable sites for groundwater recharge.
文摘Forecasting the movement of stock market is a long-time attractive topic. This paper implements different statistical learning models to predict the movement of S&P 500 index. The S&P 500 index is influenced by other important financial indexes across the world such as commodity price and financial technical indicators. This paper systematically investigated four supervised learning models, including Logistic Regression, Gaussian Discriminant Analysis (GDA), Naive Bayes and Support Vector Machine (SVM) in the forecast of S&P 500 index. After several experiments of optimization in features and models, especially the SVM kernel selection and feature selection for different models, this paper concludes that a SVM model with a Radial Basis Function (RBF) kernel can achieve an accuracy rate of 62.51% for the future market trend of the S&P 500 index.
文摘Following the basic principle of modem multivariate statistical analysis theory, the description model, prediction model and control model to relate chemical compositions and mechanical properties of steels are introduced. As an example, the total flowchart of components and structure/properties description, prediction and control model for chemical composition and mechanical properties of 20 and A_2 steel are presented.
基金The work was supported by the Foundation of State Education Committee of China
文摘Semi-rigid liquid crystal polymer is a class of liquid crystal polymers different from long rigid rod liquid crystal polymer to which the well-known Onsager and Flory theories are applied. In this paper, three statistical models for the semi-rigid nematic polymer were addressed. They are the elastically jointed rod model, worm-like chain model, and non-homogeneous chain model. The nematic-isotropic transition temperature was examined. The pseudo-second transition temperature is expressed analytically. Comparisons with the experiments were made and the agreements were found.
文摘This paper describes the stochastic model of the scattered electromagnetic field.Unlike common_used functional_determined models the proposed is characterised by amplitude/phase fluctuation of the received signal.This paper derives the statistical characteristic of the input signal and describes algorithm for its estimation in post_processing and real_time processing modes.Achieved characteristics allow the mapping and estimation of the surface models more accurate,moreover,such processing increase space resolution of synthetic aperture radar.
文摘The objective of this study is to analyze the sensitivity of the statistical models regarding the size of samples. The study carried out in Ivory Coast is based on annual maximum daily rainfall data collected from 26 stations. The methodological approach is based on the statistical modeling of maximum daily rainfall. Adjustments were made on several sample sizes and several return periods (2, 5, 10, 20, 50 and 100 years). The main results have shown that the 30 years series (1931-1960;1961-1990;1991-2020) are better adjusted by the Gumbel (26.92% - 53.85%) and Inverse Gamma (26.92% - 46.15%). Concerning the 60-years series (1931-1990;1961-2020), they are better adjusted by the Inverse Gamma (30.77%), Gamma (15.38% - 46.15%) and Gumbel (15.38% - 42.31%). The full chronicle 1931-2020 (90 years) presents a notable supremacy of 50% of Gumbel model over the Gamma (34.62%) and Gamma Inverse (15.38%) model. It is noted that the Gumbel is the most dominant model overall and more particularly in wet periods. The data for periods with normal and dry trends were better fitted by Gamma and Inverse Gamma.