The subset threshold auto regressive (SSTAR) model, which is capable of reproducing the limit cycle behavior of nonlinear time series, is introduced. The algorithm for fitting the sampled data with SSTAR model is pr...The subset threshold auto regressive (SSTAR) model, which is capable of reproducing the limit cycle behavior of nonlinear time series, is introduced. The algorithm for fitting the sampled data with SSTAR model is proposed and applied to model and forecast power load. Numerical example verifies that desirable accuracy of short term load forecasting can be achieved by using the SSTAR model.展开更多
To accurately model flows with shock waves using staggered-grid Lagrangian hydrodynamics, the artificial viscosity has to be introduced to convert kinetic energy into internal energy, thereby increasing the entropy ac...To accurately model flows with shock waves using staggered-grid Lagrangian hydrodynamics, the artificial viscosity has to be introduced to convert kinetic energy into internal energy, thereby increasing the entropy across shocks. Determining the appropriate strength of the artificial viscosity is an art and strongly depends on the particular problem and experience of the researcher. The objective of this study is to pose the problem of finding the appropriate strength of the artificial viscosity as an optimization problem and solve this problem using machine learning (ML) tools, specifically using surrogate models based on Gaussian Process regression (GPR) and Bayesian analysis. We describe the optimization method and discuss various practical details of its implementation. The shock-containing problems for which we apply this method all have been implemented in the LANL code FLAG (Burton in Connectivity structures and differencing techniques for staggered-grid free-Lagrange hydrodynamics, Tech. Rep. UCRL-JC-110555, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1992, in Consistent finite-volume discretization of hydrodynamic conservation laws for unstructured grids, Tech. Rep. CRL-JC-118788, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, Multidimensional discretization of conservation laws for unstructured polyhedral grids, Tech. Rep. UCRL-JC-118306, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, in FLAG, a multi-dimensional, multiple mesh, adaptive free-Lagrange, hydrodynamics code. In: NECDC, 1992). First, we apply ML to find optimal values to isolated shock problems of different strengths. Second, we apply ML to optimize the viscosity for a one-dimensional (1D) propagating detonation problem based on Zel’dovich-von Neumann-Doring (ZND) (Fickett and Davis in Detonation: theory and experiment. Dover books on physics. Dover Publications, Mineola, 2000) detonation theory using a reactive burn model. We compare results for default (currently used values in FLAG) and optimized values of the artificial viscosity for these problems demonstrating the potential for significant improvement in the accuracy of computations.展开更多
Carbon emissions have become a critical concern in the global effort to combat climate change,with each country or region contributing differently based on its economic structures,energy sources,and industrial activit...Carbon emissions have become a critical concern in the global effort to combat climate change,with each country or region contributing differently based on its economic structures,energy sources,and industrial activities.The factors influencing carbon emissions vary across countries and sectors.This study examined the factors influencing CO_(2)emissions in the 7 South American countries including Argentina,Brazil,Chile,Colombia,Ecuador,Peru,and Venezuela.We used the Seemingly Unrelated Regression(SUR)model to analyse the relationship of CO_(2)emissions with gross domestic product(GDP),renewable energy use,urbanization,industrialization,international tourism,agricultural productivity,and forest area based on data from 2000 to 2022.According to the SUR model,we found that GDP and industrialization had a moderate positive effect on CO_(2)emissions,whereas renewable energy use had a moderate negative effect on CO_(2)emissions.International tourism generally had a positive impact on CO_(2)emissions,while forest area tended to decrease CO_(2)emissions.Different variables had different effects on CO_(2)emissions in the 7 South American countries.In Argentina and Venezuela,GDP,international tourism,and agricultural productivity significantly affected CO_(2)emissions.In Colombia,GDP and international tourism had a negative impact on CO_(2)emissions.In Brazil,CO_(2)emissions were primarily driven by GDP,while in Chile,Ecuador,and Peru,international tourism had a negative effect on CO_(2)emissions.Overall,this study highlights the importance of country-specific strategies for reducing CO_(2)emissions and emphasizes the varying roles of these driving factors in shaping environmental quality in the 7 South American countries.展开更多
BACKGROUND Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.AIM To identify and build the best predictive model for predicti...BACKGROUND Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.AIM To identify and build the best predictive model for predicting cyanotic and acyanotic congenital heart disease in children during pregnancy and identify their potential risk factors.METHODS The data were collected from the Pediatric Cardiology Department at Chaudhry Pervaiz Elahi Institute of Cardiology Multan,Pakistan from December 2017 to October 2019.A sample of 3900 mothers whose children were diagnosed with identify the potential outliers.Different machine learning models were compared,and the best-fitted model was selected using the area under the curve,sensitivity,and specificity of the models.RESULTS Out of 3900 patients included,about 69.5%had acyanotic and 30.5%had cyanotic congenital heart disease.Males had more cases of acyanotic(53.6%)and cyanotic(54.5%)congenital heart disease as compared to females.The odds of having cyanotic was 1.28 times higher for children whose mothers used more fast food frequently during pregnancy.The artificial neural network model was selected as the best predictive model with an area under the curve of 0.9012,sensitivity of 65.76%,and specificity of 97.23%.CONCLUSION Children having a positive family history are at very high risk of having cyanotic and acyanotic congenital heart disease.Males are more at risk and their mothers need more care,good food,and physical activity during pregnancy.The best-fitted model for predicting cyanotic and acyanotic congenital heart disease is the artificial neural network.The results obtained and the best model identified will be useful for medical practitioners and public health scientists for an informed decision-making process about the earlier diagnosis and improve the health condition of children in Pakistan.展开更多
This study aims to analyze and predict the relationship between the average price per box in the cigarette market of City A and government procurement,providing a scientific basis and support for decision-making.By re...This study aims to analyze and predict the relationship between the average price per box in the cigarette market of City A and government procurement,providing a scientific basis and support for decision-making.By reviewing relevant theories and literature,qualitative prediction methods,regression prediction models,and other related theories were explored.Through the analysis of annual cigarette sales data and government procurement data in City A,a comprehensive understanding of the development of the tobacco industry and the economic trends of tobacco companies in the county was obtained.By predicting and analyzing the average price per box of cigarette sales across different years,corresponding prediction results were derived and compared with actual sales data.The prediction results indicate that the correlation coefficient between the average price per box of cigarette sales and government procurement is 0.982,implying that government procurement accounts for 96.4%of the changes in the average price per box of cigarettes.These findings offer an in-depth exploration of the relationship between the average price per box of cigarettes in City A and government procurement,providing a scientific foundation for corporate decision-making and market operations.展开更多
Municipal solid waste generation is strongly linked to rising human population and expanding urban areas, with significant implications on urban metabolism as well as space and place values redefinition. Effective man...Municipal solid waste generation is strongly linked to rising human population and expanding urban areas, with significant implications on urban metabolism as well as space and place values redefinition. Effective management performance of municipal solid waste management underscores the interdisciplinarity strategies. Such knowledge and skills are paramount to uncover the sources of waste generation as well as means of waste storage, collection, recycling, transportation, handling/treatment, disposal, and monitoring. This study was conducted in Dar es Salaam city. Driven by the curiosity model of the solid waste minimization performance at source, study data was collected using focus group discussion techniques to ward-level local government officers, which was triangulated with literature and documentary review. The main themes of the FGD were situational factors (SFA) and local government by-laws (LGBY). In the FGD session, sub-themes of SFA tricked to understand how MSW minimization is related to the presence and effect of services such as land use planning, availability of landfills, solid waste transfer stations, material recovery facilities, incinerators, solid waste collection bins, solid waste trucks, solid waste management budget and solid waste collection agents. Similarly, FGD on LGBY was extended by sub-themes such as contents of the by-law, community awareness of the by-law, and by-law enforcement mechanisms. While data preparation applied an analytical hierarchy process, data analysis applied an ordinary least square (OLS) regression model for sub-criteria that explain SFA and LGBY;and OLS standard residues as variables into geographically weighted regression with a resolution of 241 × 241 meter in ArcMap v10.5. Results showed that situational factors and local government by-laws have a strong relationship with the rate of minimizing solid waste dumping in water bodies (local R square = 0.94).展开更多
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode...Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.展开更多
Under-fitting problems usually occur in regression models for dam safety monitoring.To overcome the local convergence of the regression, a genetic algorithm (GA) was proposed using a real parameter coding, a ranking s...Under-fitting problems usually occur in regression models for dam safety monitoring.To overcome the local convergence of the regression, a genetic algorithm (GA) was proposed using a real parameter coding, a ranking selection operator, an arithmetical crossover operator and a uniform mutation operator, and calculated the least-square error of the observed and computed values as its fitness function. The elitist strategy was used to improve the speed of the convergence. After that, the modified genetic algorithm was applied to reassess the coefficients of the regression model and a genetic regression model was set up. As an example, a slotted gravity dam in the Northeast of China was introduced. The computational results show that the genetic regression model can solve the under-fitting problems perfectly.展开更多
Based on modeling principle of GM(1,1)model and linear regression model,a combined prediction model is established to predict equipment fault by the fitting of two models.The new prediction model takes full advantag...Based on modeling principle of GM(1,1)model and linear regression model,a combined prediction model is established to predict equipment fault by the fitting of two models.The new prediction model takes full advantage of prediction information provided by the two models and improves the prediction precision.Finally,this model is introduced to predict the system fault time according to the output voltages of a certain type of radar transmitter.展开更多
A fuzzy observations-based radial basis function neural network (FORBFNN) is presented for modeling nonlinear systems in which the observations of response are imprecise but can be represented as fuzzy membership fu...A fuzzy observations-based radial basis function neural network (FORBFNN) is presented for modeling nonlinear systems in which the observations of response are imprecise but can be represented as fuzzy membership functions. In the FORBFNN model, the weight coefficients of nodes in the hidden layer are identified by using the fuzzy expectation-maximization ( EM ) algorithm, whereas the optimal number of these nodes as well as the centers and widths of radial basis functions are automatically constructed by using a data-driven method. Namely, the method starts with an initial node, and then a new node is added in a hidden layer according to some rules. This procedure is not terminated until the model meets the preset requirements. The method considers both the accuracy and complexity of the model. Numerical simulation results show that the modeling method is effective, and the established model has high prediction accuracy.展开更多
The aim of this study was to assay the polyphenols,flavonoid,polyphenol oxidase and phenylalnine ammonialyase which were relative to the anthocyanins synthesis of purple corn. The optimization of multiple linear regre...The aim of this study was to assay the polyphenols,flavonoid,polyphenol oxidase and phenylalnine ammonialyase which were relative to the anthocyanins synthesis of purple corn. The optimization of multiple linear regression model of anthocyanins synthesis was y=4.383 86-0.205 45x1+5.479 638x2+0.195 575x4. According to standard partial regression coefficient testing,the result indicated that polyphenols content was negatively correlated with anthocyanins and the relative influence to anthocyanins synthesis was-42.7%; flavonoid content and activity of polyphenol oxidase were positively correlated with anthocyanins of purple corn and the relative influence to anthocyanins synthesis were 71.45% and 73.32% respectively. There was no positive correlation between the activity of phenylalnine ammonialyase and anthocyanins of purple corn. The establishment of multiple linear regression model of anthocyanins synthesis was to provide theory foundation of producing anthocyanins in laboratory.展开更多
Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives ...Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives and may lead them to be confined to bed.However,the effect of upper and lower motor neuron impairment and other risk factors on bilateral limb involvement is unclear.To address this issue,we retrospectively collected data from 586 amyotrophic lateral sclerosis patients with limb onset diagnosed at Peking University Third Hospital between January 2020 and May 2022.A univariate analysis revealed no significant differences in the time intervals of spread in different directions between individuals with upper motor neuron-dominant amyotrophic lateral sclerosis and those with classic amyotrophic lateral sclerosis.We used causal directed acyclic graphs for risk factor determination and Cox proportional hazards models to investigate the association between the duration of bilateral limb involvement and clinical baseline characteristics in amyotrophic lateral sclerosis patients.Multiple factor analyses revealed that higher upper motor neuron scores(hazard ratio[HR]=1.05,95%confidence interval[CI]=1.01–1.09,P=0.018),onset in the left limb(HR=0.72,95%CI=0.58–0.89,P=0.002),and a horizontal pattern of progression(HR=0.46,95%CI=0.37–0.58,P<0.001)were risk factors for a shorter interval until bilateral limb involvement.The results demonstrated that a greater degree of upper motor neuron involvement might cause contralateral limb involvement to progress more quickly in limb-onset amyotrophic lateral sclerosis patients.These findings may improve the management of amyotrophic lateral sclerosis patients with limb onset and the prediction of patient prognosis.展开更多
In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood e...In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.展开更多
In this paper, based on the theory of parameter estimation, we give a selection method and, in a sense of a good character of the parameter estimation, we think that it is very reasonable. Moreover, we offer a calcula...In this paper, based on the theory of parameter estimation, we give a selection method and, in a sense of a good character of the parameter estimation, we think that it is very reasonable. Moreover, we offer a calculation method of selection statistic and an applied example.展开更多
The radial basis function (RBF) emerged as a variant of artificial neural network. Generalized regression neural network (GRNN) is one type of RBF, and its principal advantages are that it can quickly learn and ra...The radial basis function (RBF) emerged as a variant of artificial neural network. Generalized regression neural network (GRNN) is one type of RBF, and its principal advantages are that it can quickly learn and rapidly converge to the optimal regression surface with large number of data sets. Hyperspectral reflectance (350 to 2500 nm) data were recorded at two different rice sites in two experiment fields with two cultivars, three nitrogen treatments and one plant density (45 plants m^-2). Stepwise multivariable regression model (SMR) and RBF were used to compare their predictability for the leaf area index (LAI) and green leaf chlorophyll density (GLCD) of rice based on reflectance (R) and its three different transformations, the first derivative reflectance (D1), the second derivative reflectance (D2) and the log-transformed reflectance (LOG). GRNN based on D1 was the best model for the prediction of rice LAI and CLCD. The relationships between different transformations of reflectance and rice parameters could be further improved when RBF was employed. Owing to its strong capacity for nonlinear mapping and good robustness, GRNN could maximize the sensitivity to chlorophyll content using D1. It is concluded that RBF may provide a useful exploratory and predictive tool for the estimation of rice biophysical parameters.展开更多
Landslide susceptibility mapping is significant for landslide prevention.Many approaches have been used for landslide susceptibility prediction,however,their performances are unstable.This study constructed a hybrid m...Landslide susceptibility mapping is significant for landslide prevention.Many approaches have been used for landslide susceptibility prediction,however,their performances are unstable.This study constructed a hybrid model,namely box counting dimension-based kernel logistic regression model,which uses fractal dimension calculated by box counting method as input data based on grid cells mapping unit and terrain mapping unit.The performance of this model was evaluated in the application in Zhidan County,Shaanxi Province,China.Firstly,a total of 221 landslides were identified and mapped,and 11 landslide predisposing factors were considered.Secondly,the landslide susceptibility maps(LSMs) of the study area were obtained by constructing the model on two different mapping units.Finally,the results were evaluated with five statistical indexes,sensitivity,specificity,positive predictive value(PPV),negative predictive value(NPV) and Accuracy.The statistical indexes of the model obtained on the terrain mapping unit were larger than those based on grid cells mapping unit.For training and validation datasets,the area under the receiver operating characteristic curve(AUC) of the model based on terrain mapping unit were 0.9374 and 0.9527,respectively,indicating that establishing this model on the terrain mapping unit was advantageous in the study area.The results show that the fractal dimension improves the prediction ability of the kernel logistic model.In addition,the terrain mapping unit is a more promising mapping unit in Loess areas.展开更多
In this paper we apply the nonlinear time series analysis method to small-time scale traffic measurement data. The prediction-based method is used to determine the embedding dimension of the traffic data. Based on the...In this paper we apply the nonlinear time series analysis method to small-time scale traffic measurement data. The prediction-based method is used to determine the embedding dimension of the traffic data. Based on the reconstructed phase space, the local support vector machine prediction method is used to predict the traffic measurement data, and the BIC-based neighbouring point selection method is used to choose the number of the nearest neighbouring points for the local support vector machine regression model. The experimental results show that the local support vector machine prediction method whose neighbouring points are optimized can effectively predict the small-time scale traffic measurement data and can reproduce the statistical features of real traffic measurements.展开更多
Backgrounds:Evaluating the growth performance of pigs in real-time is laborious and expensive,thus mathematical models based on easily accessible variables are developed.Multiple regression(MR)is the most widely used ...Backgrounds:Evaluating the growth performance of pigs in real-time is laborious and expensive,thus mathematical models based on easily accessible variables are developed.Multiple regression(MR)is the most widely used tool to build prediction models in swine nutrition,while the artificial neural networks(ANN)model is reported to be more accurate than MR model in prediction performance.Therefore,the potential of ANN models in predicting the growth performance of pigs was evaluated and compared with MR models in this study.Results:Body weight(BW),net energy(NE)intake,standardized ileal digestible lysine(SID Lys)intake,and their quadratic terms were selected as input variables to predict ADG and F/G among 10 candidate variables.In the training phase,MR models showed high accuracy in both ADG and F/G prediction(R^(2)_(ADG)=0.929,R^(2)_(F/G)=0.886)while ANN models with 4,6 neurons and radial basis activation function yielded the best performance in ADG and F/G prediction(R^(2)_(ADG)=0.964,R^(2)_(F/G)=0.932).In the testing phase,these ANN models showed better accuracy in ADG prediction(CCC:0.976 vs.0.861,R^(2):0.951 vs.0.584),and F/G prediction(CCC:0.952 vs.0.900,R^(2):0.905 vs.0.821)compared with the MR models.Meanwhile,the“over-fitting”occurred in MR models but not in ANN models.On validation data from the animal trial,ANN models exhibited superiority over MR models in both ADG and F/G prediction(P<0.01).Moreover,the growth stages have a significant effect on the prediction accuracy of the models.Conclusion:Body weight,NE intake and SID Lys intake can be used as input variables to predict the growth performance of growing-finishing pigs,with trained ANN models are more flexible and accurate than MR models.Therefore,it is promising to use ANN models in related swine nutrition studies in the future.展开更多
Background: Leaf Area Index(LAI) is an important parameter used in monitoring and modeling of forest ecosystems. The aim of this study was to evaluate performance of the artificial neural network(ANN) models to predic...Background: Leaf Area Index(LAI) is an important parameter used in monitoring and modeling of forest ecosystems. The aim of this study was to evaluate performance of the artificial neural network(ANN) models to predict the LAI by comparing the regression analysis models as the classical method in these pure and even-aged Crimean pine forest stands.Methods: One hundred eight temporary sample plots were collected from Crimean pine forest stands to estimate stand parameters. Each sample plot was imaged with hemispherical photographs to detect the LAI. The partial correlation analysis was used to assess the relationships between the stand LAI values and stand parameters, and the multivariate linear regression analysis was used to predict the LAI from stand parameters. Different artificial neural network models comprising different number of neuron and transfer functions were trained and used to predict the LAI of forest stands.Results: The correlation coefficients between LAI and stand parameters(stand number of trees, basal area, the quadratic mean diameter, stand density and stand age) were significant at the level of 0.01. The stand age, number of trees, site index, and basal area were independent parameters in the most successful regression model predicted LAI values using stand parameters(R_(adj)~2=0.5431). As corresponding method to predict the interactions between the stand LAI values and stand parameters, the neural network architecture based on the RBF 4-19-1 with Gaussian activation function in hidden layer and the identity activation function in output layer performed better in predicting LAI(SSE(12.1040), MSE(0.1223), RMSE(0.3497), AIC(0.1040), BIC(-77.7310) and R^2(0.6392)) compared to the other studied techniques.Conclusion: The ANN outperformed the multivariate regression techniques in predicting LAI from stand parameters. The ANN models, developed in this study, may aid in making forest management planning in study forest stands.展开更多
A geometric framework is proposed for semiparametric nonlinear regression models based on the concept of least favorable curve, introduced by Severini and Wong (1992). The authors use this framework to drive three kin...A geometric framework is proposed for semiparametric nonlinear regression models based on the concept of least favorable curve, introduced by Severini and Wong (1992). The authors use this framework to drive three kinds of improved approximate confidence regions for the parameter and parameter subset in terms of curvatures. The results obtained by Hamilton et al. (1982), Hamilton (1986) and Wei (1994) are extended to semiparametric nonlinear regression models.展开更多
文摘The subset threshold auto regressive (SSTAR) model, which is capable of reproducing the limit cycle behavior of nonlinear time series, is introduced. The algorithm for fitting the sampled data with SSTAR model is proposed and applied to model and forecast power load. Numerical example verifies that desirable accuracy of short term load forecasting can be achieved by using the SSTAR model.
基金This work was performed under the auspices of the National Nuclear Security Administration of the US Department of Energy at Los Alamos National Laboratory under Contract No.89233218CNA000001The Authors gratefully acknowledge the support of the US Department of Energy National Nuclear Security Administration Advanced Simulation and Computing Program.LA-UR-22-33159.
文摘To accurately model flows with shock waves using staggered-grid Lagrangian hydrodynamics, the artificial viscosity has to be introduced to convert kinetic energy into internal energy, thereby increasing the entropy across shocks. Determining the appropriate strength of the artificial viscosity is an art and strongly depends on the particular problem and experience of the researcher. The objective of this study is to pose the problem of finding the appropriate strength of the artificial viscosity as an optimization problem and solve this problem using machine learning (ML) tools, specifically using surrogate models based on Gaussian Process regression (GPR) and Bayesian analysis. We describe the optimization method and discuss various practical details of its implementation. The shock-containing problems for which we apply this method all have been implemented in the LANL code FLAG (Burton in Connectivity structures and differencing techniques for staggered-grid free-Lagrange hydrodynamics, Tech. Rep. UCRL-JC-110555, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1992, in Consistent finite-volume discretization of hydrodynamic conservation laws for unstructured grids, Tech. Rep. CRL-JC-118788, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, Multidimensional discretization of conservation laws for unstructured polyhedral grids, Tech. Rep. UCRL-JC-118306, Lawrence Livermore National Laboratory, Livermore, CA, 1992, 1994, in FLAG, a multi-dimensional, multiple mesh, adaptive free-Lagrange, hydrodynamics code. In: NECDC, 1992). First, we apply ML to find optimal values to isolated shock problems of different strengths. Second, we apply ML to optimize the viscosity for a one-dimensional (1D) propagating detonation problem based on Zel’dovich-von Neumann-Doring (ZND) (Fickett and Davis in Detonation: theory and experiment. Dover books on physics. Dover Publications, Mineola, 2000) detonation theory using a reactive burn model. We compare results for default (currently used values in FLAG) and optimized values of the artificial viscosity for these problems demonstrating the potential for significant improvement in the accuracy of computations.
文摘Carbon emissions have become a critical concern in the global effort to combat climate change,with each country or region contributing differently based on its economic structures,energy sources,and industrial activities.The factors influencing carbon emissions vary across countries and sectors.This study examined the factors influencing CO_(2)emissions in the 7 South American countries including Argentina,Brazil,Chile,Colombia,Ecuador,Peru,and Venezuela.We used the Seemingly Unrelated Regression(SUR)model to analyse the relationship of CO_(2)emissions with gross domestic product(GDP),renewable energy use,urbanization,industrialization,international tourism,agricultural productivity,and forest area based on data from 2000 to 2022.According to the SUR model,we found that GDP and industrialization had a moderate positive effect on CO_(2)emissions,whereas renewable energy use had a moderate negative effect on CO_(2)emissions.International tourism generally had a positive impact on CO_(2)emissions,while forest area tended to decrease CO_(2)emissions.Different variables had different effects on CO_(2)emissions in the 7 South American countries.In Argentina and Venezuela,GDP,international tourism,and agricultural productivity significantly affected CO_(2)emissions.In Colombia,GDP and international tourism had a negative impact on CO_(2)emissions.In Brazil,CO_(2)emissions were primarily driven by GDP,while in Chile,Ecuador,and Peru,international tourism had a negative effect on CO_(2)emissions.Overall,this study highlights the importance of country-specific strategies for reducing CO_(2)emissions and emphasizes the varying roles of these driving factors in shaping environmental quality in the 7 South American countries.
文摘BACKGROUND Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.AIM To identify and build the best predictive model for predicting cyanotic and acyanotic congenital heart disease in children during pregnancy and identify their potential risk factors.METHODS The data were collected from the Pediatric Cardiology Department at Chaudhry Pervaiz Elahi Institute of Cardiology Multan,Pakistan from December 2017 to October 2019.A sample of 3900 mothers whose children were diagnosed with identify the potential outliers.Different machine learning models were compared,and the best-fitted model was selected using the area under the curve,sensitivity,and specificity of the models.RESULTS Out of 3900 patients included,about 69.5%had acyanotic and 30.5%had cyanotic congenital heart disease.Males had more cases of acyanotic(53.6%)and cyanotic(54.5%)congenital heart disease as compared to females.The odds of having cyanotic was 1.28 times higher for children whose mothers used more fast food frequently during pregnancy.The artificial neural network model was selected as the best predictive model with an area under the curve of 0.9012,sensitivity of 65.76%,and specificity of 97.23%.CONCLUSION Children having a positive family history are at very high risk of having cyanotic and acyanotic congenital heart disease.Males are more at risk and their mothers need more care,good food,and physical activity during pregnancy.The best-fitted model for predicting cyanotic and acyanotic congenital heart disease is the artificial neural network.The results obtained and the best model identified will be useful for medical practitioners and public health scientists for an informed decision-making process about the earlier diagnosis and improve the health condition of children in Pakistan.
基金National Social Science Fund Project“Research on the Operational Risks and Prevention of Government Procurement of Community Services Project System”(Project No.21CSH018)Research and Application of SDM Cigarette Supply Strategy Based on Consumer Data Analysis(Project No.2023ASXM07)。
文摘This study aims to analyze and predict the relationship between the average price per box in the cigarette market of City A and government procurement,providing a scientific basis and support for decision-making.By reviewing relevant theories and literature,qualitative prediction methods,regression prediction models,and other related theories were explored.Through the analysis of annual cigarette sales data and government procurement data in City A,a comprehensive understanding of the development of the tobacco industry and the economic trends of tobacco companies in the county was obtained.By predicting and analyzing the average price per box of cigarette sales across different years,corresponding prediction results were derived and compared with actual sales data.The prediction results indicate that the correlation coefficient between the average price per box of cigarette sales and government procurement is 0.982,implying that government procurement accounts for 96.4%of the changes in the average price per box of cigarettes.These findings offer an in-depth exploration of the relationship between the average price per box of cigarettes in City A and government procurement,providing a scientific foundation for corporate decision-making and market operations.
文摘Municipal solid waste generation is strongly linked to rising human population and expanding urban areas, with significant implications on urban metabolism as well as space and place values redefinition. Effective management performance of municipal solid waste management underscores the interdisciplinarity strategies. Such knowledge and skills are paramount to uncover the sources of waste generation as well as means of waste storage, collection, recycling, transportation, handling/treatment, disposal, and monitoring. This study was conducted in Dar es Salaam city. Driven by the curiosity model of the solid waste minimization performance at source, study data was collected using focus group discussion techniques to ward-level local government officers, which was triangulated with literature and documentary review. The main themes of the FGD were situational factors (SFA) and local government by-laws (LGBY). In the FGD session, sub-themes of SFA tricked to understand how MSW minimization is related to the presence and effect of services such as land use planning, availability of landfills, solid waste transfer stations, material recovery facilities, incinerators, solid waste collection bins, solid waste trucks, solid waste management budget and solid waste collection agents. Similarly, FGD on LGBY was extended by sub-themes such as contents of the by-law, community awareness of the by-law, and by-law enforcement mechanisms. While data preparation applied an analytical hierarchy process, data analysis applied an ordinary least square (OLS) regression model for sub-criteria that explain SFA and LGBY;and OLS standard residues as variables into geographically weighted regression with a resolution of 241 × 241 meter in ArcMap v10.5. Results showed that situational factors and local government by-laws have a strong relationship with the rate of minimizing solid waste dumping in water bodies (local R square = 0.94).
文摘Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.
文摘Under-fitting problems usually occur in regression models for dam safety monitoring.To overcome the local convergence of the regression, a genetic algorithm (GA) was proposed using a real parameter coding, a ranking selection operator, an arithmetical crossover operator and a uniform mutation operator, and calculated the least-square error of the observed and computed values as its fitness function. The elitist strategy was used to improve the speed of the convergence. After that, the modified genetic algorithm was applied to reassess the coefficients of the regression model and a genetic regression model was set up. As an example, a slotted gravity dam in the Northeast of China was introduced. The computational results show that the genetic regression model can solve the under-fitting problems perfectly.
基金National Natural Science Foundation of China(No.51175480)
文摘Based on modeling principle of GM(1,1)model and linear regression model,a combined prediction model is established to predict equipment fault by the fitting of two models.The new prediction model takes full advantage of prediction information provided by the two models and improves the prediction precision.Finally,this model is introduced to predict the system fault time according to the output voltages of a certain type of radar transmitter.
基金The National Natural Science Foundation of China(No.51106025,51106027,51036002)Specialized Research Fund for the Doctoral Program of Higher Education(No.20130092110061)the Youth Foundation of Nanjing Institute of Technology(No.QKJA201303)
文摘A fuzzy observations-based radial basis function neural network (FORBFNN) is presented for modeling nonlinear systems in which the observations of response are imprecise but can be represented as fuzzy membership functions. In the FORBFNN model, the weight coefficients of nodes in the hidden layer are identified by using the fuzzy expectation-maximization ( EM ) algorithm, whereas the optimal number of these nodes as well as the centers and widths of radial basis functions are automatically constructed by using a data-driven method. Namely, the method starts with an initial node, and then a new node is added in a hidden layer according to some rules. This procedure is not terminated until the model meets the preset requirements. The method considers both the accuracy and complexity of the model. Numerical simulation results show that the modeling method is effective, and the established model has high prediction accuracy.
文摘The aim of this study was to assay the polyphenols,flavonoid,polyphenol oxidase and phenylalnine ammonialyase which were relative to the anthocyanins synthesis of purple corn. The optimization of multiple linear regression model of anthocyanins synthesis was y=4.383 86-0.205 45x1+5.479 638x2+0.195 575x4. According to standard partial regression coefficient testing,the result indicated that polyphenols content was negatively correlated with anthocyanins and the relative influence to anthocyanins synthesis was-42.7%; flavonoid content and activity of polyphenol oxidase were positively correlated with anthocyanins of purple corn and the relative influence to anthocyanins synthesis were 71.45% and 73.32% respectively. There was no positive correlation between the activity of phenylalnine ammonialyase and anthocyanins of purple corn. The establishment of multiple linear regression model of anthocyanins synthesis was to provide theory foundation of producing anthocyanins in laboratory.
基金supported by the National Natural Science Foundation of China,Nos.82071426,81873784Clinical Cohort Construction Program of Peking University Third Hospital,No.BYSYDL2019002(all to DF)。
文摘Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives and may lead them to be confined to bed.However,the effect of upper and lower motor neuron impairment and other risk factors on bilateral limb involvement is unclear.To address this issue,we retrospectively collected data from 586 amyotrophic lateral sclerosis patients with limb onset diagnosed at Peking University Third Hospital between January 2020 and May 2022.A univariate analysis revealed no significant differences in the time intervals of spread in different directions between individuals with upper motor neuron-dominant amyotrophic lateral sclerosis and those with classic amyotrophic lateral sclerosis.We used causal directed acyclic graphs for risk factor determination and Cox proportional hazards models to investigate the association between the duration of bilateral limb involvement and clinical baseline characteristics in amyotrophic lateral sclerosis patients.Multiple factor analyses revealed that higher upper motor neuron scores(hazard ratio[HR]=1.05,95%confidence interval[CI]=1.01–1.09,P=0.018),onset in the left limb(HR=0.72,95%CI=0.58–0.89,P=0.002),and a horizontal pattern of progression(HR=0.46,95%CI=0.37–0.58,P<0.001)were risk factors for a shorter interval until bilateral limb involvement.The results demonstrated that a greater degree of upper motor neuron involvement might cause contralateral limb involvement to progress more quickly in limb-onset amyotrophic lateral sclerosis patients.These findings may improve the management of amyotrophic lateral sclerosis patients with limb onset and the prediction of patient prognosis.
基金The National Natural Science Foundation of China(No.11171065)the Natural Science Foundation of Jiangsu Province(No.BK2011058)
文摘In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.
基金Supported by the Natural Science Foundation of Anhui Education Committee
文摘In this paper, based on the theory of parameter estimation, we give a selection method and, in a sense of a good character of the parameter estimation, we think that it is very reasonable. Moreover, we offer a calculation method of selection statistic and an applied example.
基金Project supported by the National Natural Science Foundation of China (No.40571115)the National High Tech-nology Research and Development Program (863 Program) of China (Nos.2006AA120101 and 2007AA10Z205)
文摘The radial basis function (RBF) emerged as a variant of artificial neural network. Generalized regression neural network (GRNN) is one type of RBF, and its principal advantages are that it can quickly learn and rapidly converge to the optimal regression surface with large number of data sets. Hyperspectral reflectance (350 to 2500 nm) data were recorded at two different rice sites in two experiment fields with two cultivars, three nitrogen treatments and one plant density (45 plants m^-2). Stepwise multivariable regression model (SMR) and RBF were used to compare their predictability for the leaf area index (LAI) and green leaf chlorophyll density (GLCD) of rice based on reflectance (R) and its three different transformations, the first derivative reflectance (D1), the second derivative reflectance (D2) and the log-transformed reflectance (LOG). GRNN based on D1 was the best model for the prediction of rice LAI and CLCD. The relationships between different transformations of reflectance and rice parameters could be further improved when RBF was employed. Owing to its strong capacity for nonlinear mapping and good robustness, GRNN could maximize the sensitivity to chlorophyll content using D1. It is concluded that RBF may provide a useful exploratory and predictive tool for the estimation of rice biophysical parameters.
基金funded by National Key Research and Development Program of China, Ecological Safety Guarantee Technology and Demonstration Channel and Slope Treatment Project in Loess Hilly and Gully Area (Grant No. 2017YFC0504700)。
文摘Landslide susceptibility mapping is significant for landslide prevention.Many approaches have been used for landslide susceptibility prediction,however,their performances are unstable.This study constructed a hybrid model,namely box counting dimension-based kernel logistic regression model,which uses fractal dimension calculated by box counting method as input data based on grid cells mapping unit and terrain mapping unit.The performance of this model was evaluated in the application in Zhidan County,Shaanxi Province,China.Firstly,a total of 221 landslides were identified and mapped,and 11 landslide predisposing factors were considered.Secondly,the landslide susceptibility maps(LSMs) of the study area were obtained by constructing the model on two different mapping units.Finally,the results were evaluated with five statistical indexes,sensitivity,specificity,positive predictive value(PPV),negative predictive value(NPV) and Accuracy.The statistical indexes of the model obtained on the terrain mapping unit were larger than those based on grid cells mapping unit.For training and validation datasets,the area under the receiver operating characteristic curve(AUC) of the model based on terrain mapping unit were 0.9374 and 0.9527,respectively,indicating that establishing this model on the terrain mapping unit was advantageous in the study area.The results show that the fractal dimension improves the prediction ability of the kernel logistic model.In addition,the terrain mapping unit is a more promising mapping unit in Loess areas.
基金Project supported by the National Natural Science Foundation of China (Grant No 60573065)the Natural Science Foundation of Shandong Province,China (Grant No Y2007G33)the Key Subject Research Foundation of Shandong Province,China(Grant No XTD0708)
文摘In this paper we apply the nonlinear time series analysis method to small-time scale traffic measurement data. The prediction-based method is used to determine the embedding dimension of the traffic data. Based on the reconstructed phase space, the local support vector machine prediction method is used to predict the traffic measurement data, and the BIC-based neighbouring point selection method is used to choose the number of the nearest neighbouring points for the local support vector machine regression model. The experimental results show that the local support vector machine prediction method whose neighbouring points are optimized can effectively predict the small-time scale traffic measurement data and can reproduce the statistical features of real traffic measurements.
基金funded by the National Natural Science Foundation of China(32072764, 31702121)the 2115 Talent Development Program of China Agricultural UniversityNational Key Research and Development Program of China (2019YFD1002605)
文摘Backgrounds:Evaluating the growth performance of pigs in real-time is laborious and expensive,thus mathematical models based on easily accessible variables are developed.Multiple regression(MR)is the most widely used tool to build prediction models in swine nutrition,while the artificial neural networks(ANN)model is reported to be more accurate than MR model in prediction performance.Therefore,the potential of ANN models in predicting the growth performance of pigs was evaluated and compared with MR models in this study.Results:Body weight(BW),net energy(NE)intake,standardized ileal digestible lysine(SID Lys)intake,and their quadratic terms were selected as input variables to predict ADG and F/G among 10 candidate variables.In the training phase,MR models showed high accuracy in both ADG and F/G prediction(R^(2)_(ADG)=0.929,R^(2)_(F/G)=0.886)while ANN models with 4,6 neurons and radial basis activation function yielded the best performance in ADG and F/G prediction(R^(2)_(ADG)=0.964,R^(2)_(F/G)=0.932).In the testing phase,these ANN models showed better accuracy in ADG prediction(CCC:0.976 vs.0.861,R^(2):0.951 vs.0.584),and F/G prediction(CCC:0.952 vs.0.900,R^(2):0.905 vs.0.821)compared with the MR models.Meanwhile,the“over-fitting”occurred in MR models but not in ANN models.On validation data from the animal trial,ANN models exhibited superiority over MR models in both ADG and F/G prediction(P<0.01).Moreover,the growth stages have a significant effect on the prediction accuracy of the models.Conclusion:Body weight,NE intake and SID Lys intake can be used as input variables to predict the growth performance of growing-finishing pigs,with trained ANN models are more flexible and accurate than MR models.Therefore,it is promising to use ANN models in related swine nutrition studies in the future.
基金Funding from The Scientific and Technological Research Council of Turkey(Project No:2130026)is gratefully acknowledged
文摘Background: Leaf Area Index(LAI) is an important parameter used in monitoring and modeling of forest ecosystems. The aim of this study was to evaluate performance of the artificial neural network(ANN) models to predict the LAI by comparing the regression analysis models as the classical method in these pure and even-aged Crimean pine forest stands.Methods: One hundred eight temporary sample plots were collected from Crimean pine forest stands to estimate stand parameters. Each sample plot was imaged with hemispherical photographs to detect the LAI. The partial correlation analysis was used to assess the relationships between the stand LAI values and stand parameters, and the multivariate linear regression analysis was used to predict the LAI from stand parameters. Different artificial neural network models comprising different number of neuron and transfer functions were trained and used to predict the LAI of forest stands.Results: The correlation coefficients between LAI and stand parameters(stand number of trees, basal area, the quadratic mean diameter, stand density and stand age) were significant at the level of 0.01. The stand age, number of trees, site index, and basal area were independent parameters in the most successful regression model predicted LAI values using stand parameters(R_(adj)~2=0.5431). As corresponding method to predict the interactions between the stand LAI values and stand parameters, the neural network architecture based on the RBF 4-19-1 with Gaussian activation function in hidden layer and the identity activation function in output layer performed better in predicting LAI(SSE(12.1040), MSE(0.1223), RMSE(0.3497), AIC(0.1040), BIC(-77.7310) and R^2(0.6392)) compared to the other studied techniques.Conclusion: The ANN outperformed the multivariate regression techniques in predicting LAI from stand parameters. The ANN models, developed in this study, may aid in making forest management planning in study forest stands.
文摘A geometric framework is proposed for semiparametric nonlinear regression models based on the concept of least favorable curve, introduced by Severini and Wong (1992). The authors use this framework to drive three kinds of improved approximate confidence regions for the parameter and parameter subset in terms of curvatures. The results obtained by Hamilton et al. (1982), Hamilton (1986) and Wei (1994) are extended to semiparametric nonlinear regression models.