Many properties of fruit are influenced by plant nutrition. Fruit firmness is one of the most important fruit characteristics and determines post-harvest life of the fruit, in recent decades, artificial intelligence s...Many properties of fruit are influenced by plant nutrition. Fruit firmness is one of the most important fruit characteristics and determines post-harvest life of the fruit, in recent decades, artificial intelligence systems were employed for developing predictive models to estimate and predict many agriculture processes. In the present study, the predictive capabilities of multiple linear regressions (MLR) and artificial neural networks (ANNs) are evaluated to estimate fruit firmness in six months, including each of nutrients concentrations (nitrogen (N), potassium (K), calcium (Ca) and magnesium (Mg)) alone (P1), com- bination of nutrients concentrations (P2), nutrient concentration ratios alone (P3), and combination of nutrient concentrations and nutrient concentration ratios (P4). The results showed that MLR model estimated fruit firmness more accuracy than ANN model in three datasets (P1, P2 and P4). However, the application of P3 (N/Ca ratio) as the input dataset in ANN model improved the prediction of fruit firmness than the MLR model. Correlation coefficient and root mean squared error (RMSE) were 0.850 and 0.539 between the measured and the estimated data by the ANN model, respectively. Generally, the ANN model showed greater potential in determining the relationship between 6-mon-fruit firmness and nutrients concentration.展开更多
Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the fiel...Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the field data of Liantong Road of Zibo city to verify the validity and the feasibility of the theory. The results show that change point method of multiple linear regression can make out the rule of quantitative changes in traffic flow more accurately than ordinary methods. So, the change point method can be applied to traffic information management system more effectively.展开更多
Cloud infrastructural resource optimization is the process of precisely selecting the allocating the correct resources either to a workload or application.When workload execution,accuracy,and cost are accurately stabi...Cloud infrastructural resource optimization is the process of precisely selecting the allocating the correct resources either to a workload or application.When workload execution,accuracy,and cost are accurately stabilized in opposition to the best possible framework in real-time,efficiency is attained.In addition,every workload or application required for the framework is characteristic and these essentials change over time.But,the existing method was failed to ensure the high Quality of Service(QoS).In order to address this issue,a Tricube Weighted Linear Regression-based Inter Quartile(TWLR-IQ)for Cloud Infrastructural Resource Optimization is introduced.A Tricube Weighted Linear Regression is presented in the proposed method to estimate the resources(i.e.,CPU,RAM,and network bandwidth utilization)based on the usage history in each cloud server.Then,Inter Quartile Range is applied to efficiently predict the overload hosts for ensuring a smooth migration.Experimental results show that our proposed method is better than the approach in Cloudsim under various performance metrics.The results clearly showed that the proposed method can reduce the energy consumption and provide a high level of commitment with ensuring the minimum number of Virtual Machine(VM)Migrations as compared to the state-of-the-art methods.展开更多
Aiming at the problems of low accuracy,long time consumption,and failure to obtain quantita-tive fault identification results of existing automatic fault identification technic,a fault recognition method based on clus...Aiming at the problems of low accuracy,long time consumption,and failure to obtain quantita-tive fault identification results of existing automatic fault identification technic,a fault recognition method based on clustering linear regression is proposed.Firstly,Hough transform is used to detect the line segment of the enhanced image obtained by the coherence cube algorithm.Secondly,the endpoint of the line segment detected by Hough transform is taken as the key point,and the adaptive clustering linear regression algorithm is used to cluster the key points adaptively according to the lin-ear relationship between them.Finally,a fault is generated from each category of key points based on least squares curve fitting method to realize fault identification.To verify the feasibility and pro-gressiveness of the proposed method,it is compared with the traditional method and the latest meth-od on the actual seismic data through experiments,and the effectiveness of the proposed method is verified by the experimental results on the actual seismic data.展开更多
A Mobile Ad-hoc NETwork(MANET)contains numerous mobile nodes,and it forms a structure-less network associated with wireless links.But,the node movement is the key feature of MANETs;hence,the quick action of the nodes ...A Mobile Ad-hoc NETwork(MANET)contains numerous mobile nodes,and it forms a structure-less network associated with wireless links.But,the node movement is the key feature of MANETs;hence,the quick action of the nodes guides a link failure.This link failure creates more data packet drops that can cause a long time delay.As a result,measuring accurate link failure time is the key factor in the MANET.This paper presents a Fuzzy Linear Regression Method to measure Link Failure(FLRLF)and provide an optimal route in the MANET-Internet of Things(IoT).This work aims to predict link failure and improve routing efficiency in MANET.The Fuzzy Linear Regression Method(FLRM)measures the long lifespan link based on the link failure.The mobile node group is built by the Received Signal Strength(RSS).The Hill Climbing(HC)method selects the Group Leader(GL)based on node mobility,node degree and node energy.Additionally,it uses a Data Gathering node forward the infor-mation from GL to the sink node through multiple GL.The GL is identified by linking lifespan and energy using the Particle Swarm Optimization(PSO)algo-rithm.The simulation results demonstrate that the FLRLF approach increases the GL lifespan and minimizes the link failure time in the MANET.展开更多
The development of prediction supports is a critical step in information systems engineering in this era defined by the knowledge economy, the hub of which is big data. Currently, the lack of a predictive model, wheth...The development of prediction supports is a critical step in information systems engineering in this era defined by the knowledge economy, the hub of which is big data. Currently, the lack of a predictive model, whether qualitative or quantitative, depending on a company’s areas of intervention can handicap or weaken its competitive capacities, endangering its survival. In terms of quantitative prediction, depending on the efficacy criteria, a variety of methods and/or tools are available. The multiple linear regression method is one of the methods used for this purpose. A linear regression model is a regression model of an explained variable on one or more explanatory variables in which the function that links the explanatory variables to the explained variable has linear parameters. The purpose of this work is to demonstrate how to use multiple linear regressions, which is one aspect of decisional mathematics. The use of multiple linear regressions on random data, which can be replaced by real data collected by or from organizations, provides decision makers with reliable data knowledge. As a result, machine learning methods can provide decision makers with relevant and trustworthy data. The main goal of this article is therefore to define the objective function on which the influencing factors for its optimization will be defined using the linear regression method.展开更多
Social network is the mainstream medium of current information dissemination,and it is particularly important to accurately predict its propagation law.In this paper,we introduce a social network propagation model int...Social network is the mainstream medium of current information dissemination,and it is particularly important to accurately predict its propagation law.In this paper,we introduce a social network propagation model integrating multiple linear regression and infectious disease model.Firstly,we proposed the features that affect social network communication from three dimensions.Then,we predicted the node influence via multiple linear regression.Lastly,we used the node influence as the state transition of the infectious disease model to predict the trend of information dissemination in social networks.The experimental results on a real social network dataset showed that the prediction results of the model are consistent with the actual information dissemination trends.展开更多
The rock matrix bulk modulus or its inverse, the compressive coefficient, is an important input parameter for fluid substitution by the Biot-Gassmann equation in reservoir prediction. However, it is not easy to accura...The rock matrix bulk modulus or its inverse, the compressive coefficient, is an important input parameter for fluid substitution by the Biot-Gassmann equation in reservoir prediction. However, it is not easy to accurately estimate the bulk modulus by using conventional methods. In this paper, we present a new linear regression equation for calculating the parameter. In order to get this equation, we first derive a simplified Gassmann equation by using a reasonable assumption in which the compressive coefficient of the saturated pore fluid is much greater than the rock matrix, and, second, we use the Eshelby- Walsh relation to replace the equivalent modulus of a dry rock in the Gassmann equation. Results from the rock physics analysis of rock sample from a carbonate area show that rock matrix compressive coefficients calculated with water-saturated and dry rock samples using the linear regression method are very close (their error is less than 1%). This means the new method is accurate and reliable.展开更多
In the process of shield tunneling through soft soil layers,the presence of confined water ahead poses a significant threat to the stability of the tunnel face.Therefore,it is crucial to consider the impact of confine...In the process of shield tunneling through soft soil layers,the presence of confined water ahead poses a significant threat to the stability of the tunnel face.Therefore,it is crucial to consider the impact of confined water on the limit support pressure of the tunnel face.This study employed the finite element method(FEM)to analyze the limit support pressure of shield tunnel face instability within a pressurized water-containing layer.Subsequently,a multiple linear regression approach was applied to derive a concise solution formula for the limit support pressure,incorporating various influencing factors.The analysis yields the following conclusions:1)The influence of confined water on the instability mode of the tunnel face in soft soil layers makes the displacement response of the strata not significant when the face is unstable;2)The limit support pressure increases approximately linearly with the pressure head,shield tunnel diameter,and tunnel burial depth.And inversely proportional to the thickness of the impermeable layer,soil cohesion and internal friction angle;3)Through an engineering case study analysis,the results align well with those obtained from traditional theoretical methods,thereby validating the rationality of the equations proposed in this paper.Furthermore,the proposed equations overcome the limitation of traditional theoretical approaches considering the influence of changes in impermeable layer thickness.It can accurately depict the dynamic variation in the required limit support pressure to maintain the stability of the tunnel face during shield tunneling,thus better reflecting engineering reality.展开更多
Accurate software cost estimation in Global Software Development(GSD)remains challenging due to reliance on historical data and expert judgments.Traditional models,such as the Constructive Cost Model(COCOMO II),rely h...Accurate software cost estimation in Global Software Development(GSD)remains challenging due to reliance on historical data and expert judgments.Traditional models,such as the Constructive Cost Model(COCOMO II),rely heavily on historical and accurate data.In addition,expert judgment is required to set many input parameters,which can introduce subjectivity and variability in the estimation process.Consequently,there is a need to improve the current GSD models to mitigate reliance on historical data,subjectivity in expert judgment,inadequate consideration of GSD-based cost drivers and limited integration of modern technologies with cost overruns.This study introduces a novel hybrid model that synergizes the COCOMO II with Artificial Neural Networks(ANN)to address these challenges.The proposed hybrid model integrates additional GSD-based cost drivers identified through a systematic literature review and further vetted by industry experts.This article compares the effectiveness of the proposedmodelwith state-of-the-artmachine learning-basedmodels for software cost estimation.Evaluating the NASA 93 dataset by adopting twenty-six GSD-based cost drivers reveals that our hybrid model achieves superior accuracy,outperforming existing state-of-the-artmodels.The findings indicate the potential of combining COCOMO II,ANN,and additional GSD-based cost drivers to transform cost estimation in GSD.展开更多
In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not...In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not just at predicting geophysical logging curve values but also innovatively mitigate hydrocarbon depletion observed in geochemical logging.Through a rigorous assessment,we explore the efficacy of eight regression models,bifurcated into linear and nonlinear groups,to accommodate the multifaceted nature of geological datasets.Our linear model suite encompasses the Standard Equation,Ridge Regression,Least Absolute Shrinkage and Selection Operator,and Elastic Net,each presenting distinct advantages.The Standard Equation serves as a foundational benchmark,whereas Ridge Regression implements penalty terms to counteract overfitting,thus bolstering model robustness in the presence of multicollinearity.The Least Absolute Shrinkage and Selection Operator for variable selection functions to streamline models,enhancing their interpretability,while Elastic Net amalgamates the merits of Ridge Regression and Least Absolute Shrinkage and Selection Operator,offering a harmonized solution to model complexity and comprehensibility.On the nonlinear front,Gradient Descent,Kernel Ridge Regression,Support Vector Regression,and Piecewise Function-Fitting methods introduce innovative approaches.Gradient Descent assures computational efficiency in optimizing solutions,Kernel Ridge Regression leverages the kernel trick to navigate nonlinear patterns,and Support Vector Regression is proficient in forecasting extremities,pivotal for exploration risk assessment.The Piecewise Function-Fitting approach,tailored for geological data,facilitates adaptable modeling of variable interrelations,accommodating abrupt data trend shifts.Our analysis identifies Ridge Regression,particularly when augmented by Piecewise Function-Fitting,as superior in recouping hydrocarbon losses,and underscoring its utility in resource quantification refinement.Meanwhile,Kernel Ridge Regression emerges as a noteworthy strategy in ameliorating porosity-logging curve prediction for well A,evidencing its aptness for intricate geological structures.This research attests to the scientific ascendancy and broad-spectrum relevance of these regression techniques over conventional methods while heralding new horizons for their deployment in the oil and gas sector.The insights garnered from these advanced modeling strategies are set to transform geological and engineering practices in hydrocarbon prediction,evaluation,and recovery.展开更多
Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have dev...Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.展开更多
Gauge length influences the biomechanical properties of herbaceous roots such as tensile resistance,tensile strength and Young’s modulus.However,the extent to which and how these biomechanical properties of herbaceou...Gauge length influences the biomechanical properties of herbaceous roots such as tensile resistance,tensile strength and Young’s modulus.However,the extent to which and how these biomechanical properties of herbaceous roots are influenced remain unknown.To better understand the behavior of roots in tension under different conditions and to illustrate these behaviors,uniaxial tensile tests were conducted on the Poa araratica roots as the gauge length increased from 20 mm to 80 mm.Subsequently,ANOVA was used to test the impact of the significant influences of gauge length on the biomechanical properties,nonlinear regression was applied to establish the variation in the biomechanical properties with gauge length to answer the question of the extent to which the biomechanical properties are influenced,and Weibull models were subsequently introduced to illustrate how the biomechanical properties are influenced by gauge length.The results reveal that(1)the variation in biomechanical properties with root diameter depends on both the gauge length and the properties themselves;(2)the gauge length significantly impacts most of the biomechanical properties;(3)the tensile resistance,tensile strength,and tensile strain at cracks decrease as the gauge length increases,with values decreasing by 20%-300%,while Young’s modulus exhibits the opposite trend,with a corresponding increase of 30%;and(4)the Weibull distribution is suitable for describing the probability distribution of these biomechanical properties;the Weibull modulus for both tensile resistance and tensile strain at cracks linearly decrease with gauge length,whereas those for tensile strength and Young’s modulus exhibit the opposite trend.The tensile resistance,tensile strength,and tensile strain at the cracks linearly decrease with increasing gauge length,while the tensile strength and Young’s modulus linearly increase with increasing gauge length.展开更多
The change processes and trends of shoreline and tidal flat forced by human activities are essential issues for the sustainability of coastal area,which is also of great significance for understanding coastal ecologic...The change processes and trends of shoreline and tidal flat forced by human activities are essential issues for the sustainability of coastal area,which is also of great significance for understanding coastal ecological environment changes and even global changes.Based on field measurements,combined with Linear Regression(LR)model and Inverse Distance Weighing(IDW)method,this paper presents detailed analysis on the change history and trend of the shoreline and tidal flat in Bohai Bay.The shoreline faces a high erosion chance under the action of natural factors,while the tidal flat faces a different erosion and deposition patterns in Bohai Bay due to the impact of human activities.The implication of change rule for ecological protection and recovery is also discussed.Measures should be taken to protect the coastal ecological environment.The models used in this paper show a high correlation coefficient between observed and modeling data,which means that this method can be used to predict the changing trend of shoreline and tidal flat.The research results of present study can provide scientific supports for future coastal protection and management.展开更多
Based on China Family Panel Studies(CFPS)2018 data,the multiple linear regression model is used to analyze the effects of Internet use on women’s depression,and to test the robustness of the regression results.At the...Based on China Family Panel Studies(CFPS)2018 data,the multiple linear regression model is used to analyze the effects of Internet use on women’s depression,and to test the robustness of the regression results.At the same time,the effects of Internet use on mental health of women with different residence,age,marital status and physical health status are analyzed.Then,we can obtain that Internet use has a significant promoting effect on women’s mental health,while the degree of Internet use has a significant inhibitory effect on women’s mental health.In addition,the study found that women’s age,education,place of residence,marital status,length of sleep,working status and physical health status are the main factors affecting the mental health of Chinese women.In the heterogeneity investigation of residence,age,marital status and physical health status,Internet use has a greater negative impact on the Center for Epidemiological Studies Depression Scale(CES-D8)scores of women in rural areas,has a significant positive impact on the mental health of middle-aged and elderly women or women with spouses,and has a positive impact on the mental health of physically unhealthy women.Therefore,in view of women’s mental health needs and the problems existing in the use of the Internet,this paper puts forward some suggestions to further improve the overall mental health level of women.展开更多
The diameter distribution function(DDF)is a crucial tool for accurately predicting stand carbon storage(CS).The current key issue,however,is how to construct a high-precision DDF based on stand factors,site quality,an...The diameter distribution function(DDF)is a crucial tool for accurately predicting stand carbon storage(CS).The current key issue,however,is how to construct a high-precision DDF based on stand factors,site quality,and aridity index to predict stand CS in multi-species mixed forests with complex structures.This study used data from70 survey plots for mixed broadleaf Populus davidiana and Betula platyphylla forests in the Mulan Rangeland State Forest,Hebei Province,China,to construct the DDF based on maximum likelihood estimation and finite mixture model(FMM).Ordinary least squares(OLS),linear seemingly unrelated regression(LSUR),and back propagation neural network(BPNN)were used to investigate the influences of stand factors,site quality,and aridity index on the shape and scale parameters of DDF and predicted stand CS of mixed broadleaf forests.The results showed that FMM accurately described the stand-level diameter distribution of the mixed P.davidiana and B.platyphylla forests;whereas the Weibull function constructed by MLE was more accurate in describing species-level diameter distribution.The combined variable of quadratic mean diameter(Dq),stand basal area(BA),and site quality improved the accuracy of the shape parameter models of FMM;the combined variable of Dq,BA,and De Martonne aridity index improved the accuracy of the scale parameter models.Compared to OLS and LSUR,the BPNN had higher accuracy in the re-parameterization process of FMM.OLS,LSUR,and BPNN overestimated the CS of P.davidiana but underestimated the CS of B.platyphylla in the large diameter classes(DBH≥18 cm).BPNN accurately estimated stand-and species-level CS,but it was more suitable for estimating stand-level CS compared to species-level CS,thereby providing a scientific basis for the optimization of stand structure and assessment of carbon sequestration capacity in mixed broadleaf forests.展开更多
Climate change has been a matter of discourse for the last several decades. Much research has been conducted regarding the causes and impacts of climate change around the world. The current research contributes to the...Climate change has been a matter of discourse for the last several decades. Much research has been conducted regarding the causes and impacts of climate change around the world. The current research contributes to the knowledge of the influence of climate change on our environment, with emphasis on earthquake occurrences in the region of Indonesia. Using global temperature anomaly as a measure of climate change, and earthquake data in Indonesia for the period 1900-2022, the paper seeks to find a relationship (if any) between the two variables. Statistical methods used include normal distribution analysis, linear regression and correlation test. The results show peculiar patterns in the progression of earthquake occurrences as well as global temperature anomaly occurring in the same time periods. The findings also indicated that the magnitudes of earthquakes remained unaffected by global temperature anomalies over the years. Nonetheless, there appears to be a potential correlation between temperature anomalies and the frequency of earthquake occurrences. As per the results, an increase in temperature anomaly is associated with a higher frequency of earthquakes.展开更多
The embracing of ICTs and related technologies has enhanced different approaches for governments worldwide to deliver services to their citizens in a smart way. However, the usage of e-government services by common ci...The embracing of ICTs and related technologies has enhanced different approaches for governments worldwide to deliver services to their citizens in a smart way. However, the usage of e-government services by common citizens is recognized as one of the major setbacks of e-government development in both developed and developing countries. Moreover, government agencies in these countries are facing great challenges in keeping the citizens motivated enough to continue to use e-government services. This research aims to investigate the factors that influence citizens’ trust towards continue use of e-government services in Cameroon. The proposed research model consisted of three main constructs including technological, governmental, risk factors as well as six demographic characteristics (age, gender, educational level, income, internet experience and cultural perception). A five-point Likert scale questionnaire was designed to collect data physically and electronically, 352 valid questionnaires were retrieved. Simple and Multiple regression analysis methods were applied to build an adequate model based on the verification of hypotheses proposed. Based on results obtained, four demographic characteristics (age, education, occupation and income) have influence on citizens’ trust in e-government meanwhile gender and cultural affiliation have no influence. Furthermore, technological factors and governmental factors positively influence trust level in e-government, whereas risk factors have a negative influence on trust level. Deducing from the results, a list of recommendations is proposed to the government of Cameroon in order to reinforce citizens’ trust in e-government services.展开更多
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode...Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.展开更多
In basketball, each player’s skill level is the key to a team’s success or failure, the skill level is affected by many personal and environmental factors. A physics-informed AI statistics has become extremely impor...In basketball, each player’s skill level is the key to a team’s success or failure, the skill level is affected by many personal and environmental factors. A physics-informed AI statistics has become extremely important. In this article, a complex non-linear process is considered by taking into account the average points per game of each player, playing time, shooting percentage, and others. This physics-informed statistics is to construct a multiple linear regression model with physics-informed neural networks. Based on the official data provided by the American Basketball League, and combined with specific methods of R program analysis, the regression model affecting the player’s average points per game is verified, and the key factors affecting the player’s average points per game are finally elucidated. The paper provides a novel window for coaches to make meaningful in-game adjustments to team members.展开更多
文摘Many properties of fruit are influenced by plant nutrition. Fruit firmness is one of the most important fruit characteristics and determines post-harvest life of the fruit, in recent decades, artificial intelligence systems were employed for developing predictive models to estimate and predict many agriculture processes. In the present study, the predictive capabilities of multiple linear regressions (MLR) and artificial neural networks (ANNs) are evaluated to estimate fruit firmness in six months, including each of nutrients concentrations (nitrogen (N), potassium (K), calcium (Ca) and magnesium (Mg)) alone (P1), com- bination of nutrients concentrations (P2), nutrient concentration ratios alone (P3), and combination of nutrient concentrations and nutrient concentration ratios (P4). The results showed that MLR model estimated fruit firmness more accuracy than ANN model in three datasets (P1, P2 and P4). However, the application of P3 (N/Ca ratio) as the input dataset in ANN model improved the prediction of fruit firmness than the MLR model. Correlation coefficient and root mean squared error (RMSE) were 0.850 and 0.539 between the measured and the estimated data by the ANN model, respectively. Generally, the ANN model showed greater potential in determining the relationship between 6-mon-fruit firmness and nutrients concentration.
基金National Natural Science Foundations of China(No. 61074140,No. 60974094)Young Teacher Development Support Project of Shandong University of Technology,China
文摘Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the field data of Liantong Road of Zibo city to verify the validity and the feasibility of the theory. The results show that change point method of multiple linear regression can make out the rule of quantitative changes in traffic flow more accurately than ordinary methods. So, the change point method can be applied to traffic information management system more effectively.
文摘Cloud infrastructural resource optimization is the process of precisely selecting the allocating the correct resources either to a workload or application.When workload execution,accuracy,and cost are accurately stabilized in opposition to the best possible framework in real-time,efficiency is attained.In addition,every workload or application required for the framework is characteristic and these essentials change over time.But,the existing method was failed to ensure the high Quality of Service(QoS).In order to address this issue,a Tricube Weighted Linear Regression-based Inter Quartile(TWLR-IQ)for Cloud Infrastructural Resource Optimization is introduced.A Tricube Weighted Linear Regression is presented in the proposed method to estimate the resources(i.e.,CPU,RAM,and network bandwidth utilization)based on the usage history in each cloud server.Then,Inter Quartile Range is applied to efficiently predict the overload hosts for ensuring a smooth migration.Experimental results show that our proposed method is better than the approach in Cloudsim under various performance metrics.The results clearly showed that the proposed method can reduce the energy consumption and provide a high level of commitment with ensuring the minimum number of Virtual Machine(VM)Migrations as compared to the state-of-the-art methods.
基金the National Natural Science Foundation of China(No.41804135)the Key Laboratory of Petroleum Resources Research,Institute of Geology and Geophysics,Chinese Academy of Sciences,Open Project(No.KLOR2018-9)the Beijing Information Science and Technology University Research Fund Project(No.2025025).
文摘Aiming at the problems of low accuracy,long time consumption,and failure to obtain quantita-tive fault identification results of existing automatic fault identification technic,a fault recognition method based on clustering linear regression is proposed.Firstly,Hough transform is used to detect the line segment of the enhanced image obtained by the coherence cube algorithm.Secondly,the endpoint of the line segment detected by Hough transform is taken as the key point,and the adaptive clustering linear regression algorithm is used to cluster the key points adaptively according to the lin-ear relationship between them.Finally,a fault is generated from each category of key points based on least squares curve fitting method to realize fault identification.To verify the feasibility and pro-gressiveness of the proposed method,it is compared with the traditional method and the latest meth-od on the actual seismic data through experiments,and the effectiveness of the proposed method is verified by the experimental results on the actual seismic data.
文摘A Mobile Ad-hoc NETwork(MANET)contains numerous mobile nodes,and it forms a structure-less network associated with wireless links.But,the node movement is the key feature of MANETs;hence,the quick action of the nodes guides a link failure.This link failure creates more data packet drops that can cause a long time delay.As a result,measuring accurate link failure time is the key factor in the MANET.This paper presents a Fuzzy Linear Regression Method to measure Link Failure(FLRLF)and provide an optimal route in the MANET-Internet of Things(IoT).This work aims to predict link failure and improve routing efficiency in MANET.The Fuzzy Linear Regression Method(FLRM)measures the long lifespan link based on the link failure.The mobile node group is built by the Received Signal Strength(RSS).The Hill Climbing(HC)method selects the Group Leader(GL)based on node mobility,node degree and node energy.Additionally,it uses a Data Gathering node forward the infor-mation from GL to the sink node through multiple GL.The GL is identified by linking lifespan and energy using the Particle Swarm Optimization(PSO)algo-rithm.The simulation results demonstrate that the FLRLF approach increases the GL lifespan and minimizes the link failure time in the MANET.
文摘The development of prediction supports is a critical step in information systems engineering in this era defined by the knowledge economy, the hub of which is big data. Currently, the lack of a predictive model, whether qualitative or quantitative, depending on a company’s areas of intervention can handicap or weaken its competitive capacities, endangering its survival. In terms of quantitative prediction, depending on the efficacy criteria, a variety of methods and/or tools are available. The multiple linear regression method is one of the methods used for this purpose. A linear regression model is a regression model of an explained variable on one or more explanatory variables in which the function that links the explanatory variables to the explained variable has linear parameters. The purpose of this work is to demonstrate how to use multiple linear regressions, which is one aspect of decisional mathematics. The use of multiple linear regressions on random data, which can be replaced by real data collected by or from organizations, provides decision makers with reliable data knowledge. As a result, machine learning methods can provide decision makers with relevant and trustworthy data. The main goal of this article is therefore to define the objective function on which the influencing factors for its optimization will be defined using the linear regression method.
基金This work was supported by the 2021 Project of the“14th Five-Year Plan”of Shaanxi Education Science“Research on the Application of Educational Data Mining in Applied Undergraduate Teaching-Taking the Course of‘Computer Application Technology’as an Example”(SGH21Y0403)the Teaching Reform and Research Projects for Practical Teaching in 2022“Research on Practical Teaching of Applied Undergraduate Projects Based on‘Combination of Courses and Certificates”-Taking Computer Application Technology Courses as an Example”(SJJG02012)the 11th batch of Teaching Reform Research Project of Xi’an Jiaotong University City College“Project-Driven Cultivation and Research on Information Literacy of Applied Undergraduate Students in the Information Times-Taking Computer Application Technology Course Teaching as an Example”(111001).
文摘Social network is the mainstream medium of current information dissemination,and it is particularly important to accurately predict its propagation law.In this paper,we introduce a social network propagation model integrating multiple linear regression and infectious disease model.Firstly,we proposed the features that affect social network communication from three dimensions.Then,we predicted the node influence via multiple linear regression.Lastly,we used the node influence as the state transition of the infectious disease model to predict the trend of information dissemination in social networks.The experimental results on a real social network dataset showed that the prediction results of the model are consistent with the actual information dissemination trends.
基金supported by the National Nature Science Foundation of China (Grant Noss 40739907 and 40774064)National Science and Technology Major Project (Grant No. 2008ZX05025-003)
文摘The rock matrix bulk modulus or its inverse, the compressive coefficient, is an important input parameter for fluid substitution by the Biot-Gassmann equation in reservoir prediction. However, it is not easy to accurately estimate the bulk modulus by using conventional methods. In this paper, we present a new linear regression equation for calculating the parameter. In order to get this equation, we first derive a simplified Gassmann equation by using a reasonable assumption in which the compressive coefficient of the saturated pore fluid is much greater than the rock matrix, and, second, we use the Eshelby- Walsh relation to replace the equivalent modulus of a dry rock in the Gassmann equation. Results from the rock physics analysis of rock sample from a carbonate area show that rock matrix compressive coefficients calculated with water-saturated and dry rock samples using the linear regression method are very close (their error is less than 1%). This means the new method is accurate and reliable.
基金Project(ZDRW-ZS-2021-3)supported by the Key Deployment Projects of Chinese Academy of SciencesProjects(52179116,51991392)supported by the National Natural Science Foundation of China。
文摘In the process of shield tunneling through soft soil layers,the presence of confined water ahead poses a significant threat to the stability of the tunnel face.Therefore,it is crucial to consider the impact of confined water on the limit support pressure of the tunnel face.This study employed the finite element method(FEM)to analyze the limit support pressure of shield tunnel face instability within a pressurized water-containing layer.Subsequently,a multiple linear regression approach was applied to derive a concise solution formula for the limit support pressure,incorporating various influencing factors.The analysis yields the following conclusions:1)The influence of confined water on the instability mode of the tunnel face in soft soil layers makes the displacement response of the strata not significant when the face is unstable;2)The limit support pressure increases approximately linearly with the pressure head,shield tunnel diameter,and tunnel burial depth.And inversely proportional to the thickness of the impermeable layer,soil cohesion and internal friction angle;3)Through an engineering case study analysis,the results align well with those obtained from traditional theoretical methods,thereby validating the rationality of the equations proposed in this paper.Furthermore,the proposed equations overcome the limitation of traditional theoretical approaches considering the influence of changes in impermeable layer thickness.It can accurately depict the dynamic variation in the required limit support pressure to maintain the stability of the tunnel face during shield tunneling,thus better reflecting engineering reality.
文摘Accurate software cost estimation in Global Software Development(GSD)remains challenging due to reliance on historical data and expert judgments.Traditional models,such as the Constructive Cost Model(COCOMO II),rely heavily on historical and accurate data.In addition,expert judgment is required to set many input parameters,which can introduce subjectivity and variability in the estimation process.Consequently,there is a need to improve the current GSD models to mitigate reliance on historical data,subjectivity in expert judgment,inadequate consideration of GSD-based cost drivers and limited integration of modern technologies with cost overruns.This study introduces a novel hybrid model that synergizes the COCOMO II with Artificial Neural Networks(ANN)to address these challenges.The proposed hybrid model integrates additional GSD-based cost drivers identified through a systematic literature review and further vetted by industry experts.This article compares the effectiveness of the proposedmodelwith state-of-the-artmachine learning-basedmodels for software cost estimation.Evaluating the NASA 93 dataset by adopting twenty-six GSD-based cost drivers reveals that our hybrid model achieves superior accuracy,outperforming existing state-of-the-artmodels.The findings indicate the potential of combining COCOMO II,ANN,and additional GSD-based cost drivers to transform cost estimation in GSD.
文摘In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not just at predicting geophysical logging curve values but also innovatively mitigate hydrocarbon depletion observed in geochemical logging.Through a rigorous assessment,we explore the efficacy of eight regression models,bifurcated into linear and nonlinear groups,to accommodate the multifaceted nature of geological datasets.Our linear model suite encompasses the Standard Equation,Ridge Regression,Least Absolute Shrinkage and Selection Operator,and Elastic Net,each presenting distinct advantages.The Standard Equation serves as a foundational benchmark,whereas Ridge Regression implements penalty terms to counteract overfitting,thus bolstering model robustness in the presence of multicollinearity.The Least Absolute Shrinkage and Selection Operator for variable selection functions to streamline models,enhancing their interpretability,while Elastic Net amalgamates the merits of Ridge Regression and Least Absolute Shrinkage and Selection Operator,offering a harmonized solution to model complexity and comprehensibility.On the nonlinear front,Gradient Descent,Kernel Ridge Regression,Support Vector Regression,and Piecewise Function-Fitting methods introduce innovative approaches.Gradient Descent assures computational efficiency in optimizing solutions,Kernel Ridge Regression leverages the kernel trick to navigate nonlinear patterns,and Support Vector Regression is proficient in forecasting extremities,pivotal for exploration risk assessment.The Piecewise Function-Fitting approach,tailored for geological data,facilitates adaptable modeling of variable interrelations,accommodating abrupt data trend shifts.Our analysis identifies Ridge Regression,particularly when augmented by Piecewise Function-Fitting,as superior in recouping hydrocarbon losses,and underscoring its utility in resource quantification refinement.Meanwhile,Kernel Ridge Regression emerges as a noteworthy strategy in ameliorating porosity-logging curve prediction for well A,evidencing its aptness for intricate geological structures.This research attests to the scientific ascendancy and broad-spectrum relevance of these regression techniques over conventional methods while heralding new horizons for their deployment in the oil and gas sector.The insights garnered from these advanced modeling strategies are set to transform geological and engineering practices in hydrocarbon prediction,evaluation,and recovery.
基金This research was funded by the National Natural Science Foundation of China(grant no.32271881).
文摘Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.
基金financially supported by the Key R&D Program of Shaanxi Province(2023-YBSF-324)Shaanxi Provincial Department of Education Services Local Special Plan Project(23JC019)National Natural Science of Foundation of China(42267024).
文摘Gauge length influences the biomechanical properties of herbaceous roots such as tensile resistance,tensile strength and Young’s modulus.However,the extent to which and how these biomechanical properties of herbaceous roots are influenced remain unknown.To better understand the behavior of roots in tension under different conditions and to illustrate these behaviors,uniaxial tensile tests were conducted on the Poa araratica roots as the gauge length increased from 20 mm to 80 mm.Subsequently,ANOVA was used to test the impact of the significant influences of gauge length on the biomechanical properties,nonlinear regression was applied to establish the variation in the biomechanical properties with gauge length to answer the question of the extent to which the biomechanical properties are influenced,and Weibull models were subsequently introduced to illustrate how the biomechanical properties are influenced by gauge length.The results reveal that(1)the variation in biomechanical properties with root diameter depends on both the gauge length and the properties themselves;(2)the gauge length significantly impacts most of the biomechanical properties;(3)the tensile resistance,tensile strength,and tensile strain at cracks decrease as the gauge length increases,with values decreasing by 20%-300%,while Young’s modulus exhibits the opposite trend,with a corresponding increase of 30%;and(4)the Weibull distribution is suitable for describing the probability distribution of these biomechanical properties;the Weibull modulus for both tensile resistance and tensile strain at cracks linearly decrease with gauge length,whereas those for tensile strength and Young’s modulus exhibit the opposite trend.The tensile resistance,tensile strength,and tensile strain at the cracks linearly decrease with increasing gauge length,while the tensile strength and Young’s modulus linearly increase with increasing gauge length.
基金supported by the National Natural Science Foundation of China (41602205, 42293261)the China Geological Survey Program (DD20189506, DD20211301)+2 种基金the Special Investigation Project on Science and Technology Basic Resources of the Ministry of Science and Technology (2021FY101003)the Central Guidance for Local Scientific and Technological Development Fund of 2023the Project of Hebei University of Environmental Engineering (GCY202301)
文摘The change processes and trends of shoreline and tidal flat forced by human activities are essential issues for the sustainability of coastal area,which is also of great significance for understanding coastal ecological environment changes and even global changes.Based on field measurements,combined with Linear Regression(LR)model and Inverse Distance Weighing(IDW)method,this paper presents detailed analysis on the change history and trend of the shoreline and tidal flat in Bohai Bay.The shoreline faces a high erosion chance under the action of natural factors,while the tidal flat faces a different erosion and deposition patterns in Bohai Bay due to the impact of human activities.The implication of change rule for ecological protection and recovery is also discussed.Measures should be taken to protect the coastal ecological environment.The models used in this paper show a high correlation coefficient between observed and modeling data,which means that this method can be used to predict the changing trend of shoreline and tidal flat.The research results of present study can provide scientific supports for future coastal protection and management.
基金funded by the National Social Science Fund of China(Grant No.23BTJ069).
文摘Based on China Family Panel Studies(CFPS)2018 data,the multiple linear regression model is used to analyze the effects of Internet use on women’s depression,and to test the robustness of the regression results.At the same time,the effects of Internet use on mental health of women with different residence,age,marital status and physical health status are analyzed.Then,we can obtain that Internet use has a significant promoting effect on women’s mental health,while the degree of Internet use has a significant inhibitory effect on women’s mental health.In addition,the study found that women’s age,education,place of residence,marital status,length of sleep,working status and physical health status are the main factors affecting the mental health of Chinese women.In the heterogeneity investigation of residence,age,marital status and physical health status,Internet use has a greater negative impact on the Center for Epidemiological Studies Depression Scale(CES-D8)scores of women in rural areas,has a significant positive impact on the mental health of middle-aged and elderly women or women with spouses,and has a positive impact on the mental health of physically unhealthy women.Therefore,in view of women’s mental health needs and the problems existing in the use of the Internet,this paper puts forward some suggestions to further improve the overall mental health level of women.
基金funded by the National Key Research and Development Program of China(No.2022YFD2200503-02)。
文摘The diameter distribution function(DDF)is a crucial tool for accurately predicting stand carbon storage(CS).The current key issue,however,is how to construct a high-precision DDF based on stand factors,site quality,and aridity index to predict stand CS in multi-species mixed forests with complex structures.This study used data from70 survey plots for mixed broadleaf Populus davidiana and Betula platyphylla forests in the Mulan Rangeland State Forest,Hebei Province,China,to construct the DDF based on maximum likelihood estimation and finite mixture model(FMM).Ordinary least squares(OLS),linear seemingly unrelated regression(LSUR),and back propagation neural network(BPNN)were used to investigate the influences of stand factors,site quality,and aridity index on the shape and scale parameters of DDF and predicted stand CS of mixed broadleaf forests.The results showed that FMM accurately described the stand-level diameter distribution of the mixed P.davidiana and B.platyphylla forests;whereas the Weibull function constructed by MLE was more accurate in describing species-level diameter distribution.The combined variable of quadratic mean diameter(Dq),stand basal area(BA),and site quality improved the accuracy of the shape parameter models of FMM;the combined variable of Dq,BA,and De Martonne aridity index improved the accuracy of the scale parameter models.Compared to OLS and LSUR,the BPNN had higher accuracy in the re-parameterization process of FMM.OLS,LSUR,and BPNN overestimated the CS of P.davidiana but underestimated the CS of B.platyphylla in the large diameter classes(DBH≥18 cm).BPNN accurately estimated stand-and species-level CS,but it was more suitable for estimating stand-level CS compared to species-level CS,thereby providing a scientific basis for the optimization of stand structure and assessment of carbon sequestration capacity in mixed broadleaf forests.
文摘Climate change has been a matter of discourse for the last several decades. Much research has been conducted regarding the causes and impacts of climate change around the world. The current research contributes to the knowledge of the influence of climate change on our environment, with emphasis on earthquake occurrences in the region of Indonesia. Using global temperature anomaly as a measure of climate change, and earthquake data in Indonesia for the period 1900-2022, the paper seeks to find a relationship (if any) between the two variables. Statistical methods used include normal distribution analysis, linear regression and correlation test. The results show peculiar patterns in the progression of earthquake occurrences as well as global temperature anomaly occurring in the same time periods. The findings also indicated that the magnitudes of earthquakes remained unaffected by global temperature anomalies over the years. Nonetheless, there appears to be a potential correlation between temperature anomalies and the frequency of earthquake occurrences. As per the results, an increase in temperature anomaly is associated with a higher frequency of earthquakes.
文摘The embracing of ICTs and related technologies has enhanced different approaches for governments worldwide to deliver services to their citizens in a smart way. However, the usage of e-government services by common citizens is recognized as one of the major setbacks of e-government development in both developed and developing countries. Moreover, government agencies in these countries are facing great challenges in keeping the citizens motivated enough to continue to use e-government services. This research aims to investigate the factors that influence citizens’ trust towards continue use of e-government services in Cameroon. The proposed research model consisted of three main constructs including technological, governmental, risk factors as well as six demographic characteristics (age, gender, educational level, income, internet experience and cultural perception). A five-point Likert scale questionnaire was designed to collect data physically and electronically, 352 valid questionnaires were retrieved. Simple and Multiple regression analysis methods were applied to build an adequate model based on the verification of hypotheses proposed. Based on results obtained, four demographic characteristics (age, education, occupation and income) have influence on citizens’ trust in e-government meanwhile gender and cultural affiliation have no influence. Furthermore, technological factors and governmental factors positively influence trust level in e-government, whereas risk factors have a negative influence on trust level. Deducing from the results, a list of recommendations is proposed to the government of Cameroon in order to reinforce citizens’ trust in e-government services.
文摘Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.
文摘In basketball, each player’s skill level is the key to a team’s success or failure, the skill level is affected by many personal and environmental factors. A physics-informed AI statistics has become extremely important. In this article, a complex non-linear process is considered by taking into account the average points per game of each player, playing time, shooting percentage, and others. This physics-informed statistics is to construct a multiple linear regression model with physics-informed neural networks. Based on the official data provided by the American Basketball League, and combined with specific methods of R program analysis, the regression model affecting the player’s average points per game is verified, and the key factors affecting the player’s average points per game are finally elucidated. The paper provides a novel window for coaches to make meaningful in-game adjustments to team members.