Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep...Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.展开更多
In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste...In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.展开更多
BACKGROUND The spread of the severe acute respiratory syndrome coronavirus 2 outbreak worldwide has caused concern regarding the mortality rate caused by the infection.The determinants of mortality on a global scale c...BACKGROUND The spread of the severe acute respiratory syndrome coronavirus 2 outbreak worldwide has caused concern regarding the mortality rate caused by the infection.The determinants of mortality on a global scale cannot be fully understood due to lack of information.AIM To identify key factors that may explain the variability in case lethality across countries.METHODS We identified 21 Potential risk factors for coronavirus disease 2019(COVID-19)case fatality rate for all the countries with available data.We examined univariate relationships of each variable with case fatality rate(CFR),and all independent variables to identify candidate variables for our final multiple model.Multiple regression analysis technique was used to assess the strength of relationship.RESULTS The mean of COVID-19 mortality was 1.52±1.72%.There was a statistically significant inverse correlation between health expenditure,and number of computed tomography scanners per 1 million with CFR,and significant direct correlation was found between literacy,and air pollution with CFR.This final model can predict approximately 97%of the changes in CFR.CONCLUSION The current study recommends some new predictors explaining affect mortality rate.Thus,it could help decision-makers develop health policies to fight COVID-19.展开更多
BACKGROUND Radiation pneumonitis(RP)is a severe complication of thoracic radiotherapy that may lead to dyspnea and lung fibrosis,and negatively affects patients’quality of life.AIM To carry out multiple regression an...BACKGROUND Radiation pneumonitis(RP)is a severe complication of thoracic radiotherapy that may lead to dyspnea and lung fibrosis,and negatively affects patients’quality of life.AIM To carry out multiple regression analysis on the influencing factors of radiation pneumonitis.METHODS Records of 234 patients receiving chest radiotherapy in Huzhou Central Hospital(Huzhou,Zhejiang Province,China)from January 2018 to February 2021,and the patients were divided into either a study group or a control group based on the presence of radiation pneumonitis or not.Among them,93 patients with radiation pneumonitis were included in the study group and 141 without radiation pneumonitis were included in the control group.General characteristics,and radiation and imaging examination data of the two groups were collected and compared.Due to the statistical significance observed,multiple regression analysis was performed on age,tumor type,chemotherapy history,forced vital capacity(FVC),forced expiratory volume in the first second(FEV1),carbon monoxide diffusion volume(DLCO),FEV1/FVC ratio,planned target area(PTV),mean lung dose(MLD),total number of radiation fields,percentage of lung tissue in total lung volume(vdose),probability of normal tissue complications(NTCP),and other factors.RESULTS The proportions of patients aged≥60 years and those with the diagnosis of lung cancer and a history of chemotherapy in the study group were higher than those in the control group(P<0.05);FEV1,DLCO,and FEV1/FVC ratio in the study group were lower than those in the control group(P<0.05),while PTV,MLD,total field number,vdose,and NTCP were higher than in the control group(P<0.05).Logistic regression analysis showed that age,lung cancer diagnosis,chemotherapy history,FEV1,FEV1/FVC ratio,PTV,MLD,total number of radiation fields,vdose,and NTCP were risk factors for radiation pneumonitis.CONCLUSION We have identified patient age,type of lung cancer,history of chemotherapy,lung function,and radiotherapy parameters as risk factors for radiation pneumonitis.Comprehensive evaluation and examination should be carried out before radiotherapy to effectively prevent radiation pneumonitis.展开更多
Machine Learning(ML)has changed clinical diagnostic procedures drastically.Especially in Cardiovascular Diseases(CVD),the use of ML is indispensable to reducing human errors.Enormous studies focused on disease predict...Machine Learning(ML)has changed clinical diagnostic procedures drastically.Especially in Cardiovascular Diseases(CVD),the use of ML is indispensable to reducing human errors.Enormous studies focused on disease prediction but depending on multiple parameters,further investigations are required to upgrade the clinical procedures.Multi-layered implementation of ML also called Deep Learning(DL)has unfolded new horizons in the field of clinical diagnostics.DL formulates reliable accuracy with big datasets but the reverse is the case with small datasets.This paper proposed a novel method that deals with the issue of less data dimensionality.Inspired by the regression analysis,the proposed method classifies the data by going through three different stages.In the first stage,feature representation is converted into probabilities using multiple regression techniques,the second stage grasps the probability conclusions from the previous stage and the third stage fabricates the final classifications.Extensive experiments were carried out on the Cleveland heart disease dataset.The results show significant improvement in classification accuracy.It is evident from the comparative results of the paper that the prevailing statistical ML methods are no more stagnant disease prediction techniques in demand in the future.展开更多
Rapidly spreading COVID-19 virus and its variants, especially in metropolitan areas around the world, became a major health public concern. The tendency of COVID-19 pandemic and statistical modelling represents an urg...Rapidly spreading COVID-19 virus and its variants, especially in metropolitan areas around the world, became a major health public concern. The tendency of COVID-19 pandemic and statistical modelling represents an urgent challenge in the United States for which there are few solutions. In this paper, we demonstrate combining Fourier terms for capturing seasonality with ARIMA errors and other dynamics in the data. Therefore, we have analyzed 156 weeks COVID-19 dataset on national level using Dynamic Harmonic Regression model, including simulation analysis and accuracy improvement from 2020 to 2023. Most importantly, we provide new advanced pathways which may serve as targets for developing new solutions and approaches.展开更多
BACKGROUND Endometrial cancer is one of the most commonly diagnosed gynecological cancers worldwide,and early-stage high-risk endometrial cancer has a poor prognosis.Adjuvant treatments after surgery,such as chemother...BACKGROUND Endometrial cancer is one of the most commonly diagnosed gynecological cancers worldwide,and early-stage high-risk endometrial cancer has a poor prognosis.Adjuvant treatments after surgery,such as chemotherapy and radiotherapy,have been widely used in clinical practice to improve patient survival.Medroxyprogesterone acetate is a synthetic progestogen that has been reported to have potential anticancer effects in endometrial cancer.However,its efficacy,safety,and longterm prognostic benefits as an adjuvant treatment for endometrial cancer remain controversial.Therefore,this study aimed to observe the efficacy and prognostic impact of adjuvant medroxyprogesterone acetate treatment in patients with earlystage high-risk endometrial cancer and evaluate its safety.AIM To observe the efficacy and prognosis of adjuvant treatment of endometrial cancer with medroxyprogesterone acetate and to evaluate its safety.METHODS We collected the clinical data of 200 patients with early-stage high-risk endometrial cancer who were admitted to the Department of Obstetrics and Gynecology of our hospital from January 2018 to December 2022.The control group(100 patients)underwent conventional surgical treatment,and the study group(100 patients)was administered adjuvant medroxyprogesterone acetate tablets on top of the control group.The Kaplan-Meier curve analysis and log-rank test were performed to determine the possible factors influencing the 5-year cumulative survival rate in the patients.The Cox regression analysis was performed to identify the factors influencing the survival prognosis of endometrial cancer.RESULTS According to the Cox regression analysis,age[hazard ratio(HR)=4.636,95%confidence interval(95%CI):1.411-15.237],pathological type(HR=6.943,95%CI:2.299-20.977),molecular typing(HR=5.789,95%CI:3.305-10.141),and myometrial infiltration(HR=5.768,95%CI:1.898-17.520)were factors influencing the prognosis of patients with early-stage high-risk endometrial cancer.CONCLUSION Age,pathological type,molecular typing,and myometrial infiltration were all relevant factors affecting the prognosis of early-stage high-risk endometrial cancer.The potential long-term prognostic benefit of adjuvant postoperative radiotherapy in patients with early-stage high-risk endometrial cancer is worthy of clinical consideration.展开更多
This paper studies the deterioration of bridge substructures utilizing the Long-Term Bridge Performance(LTBP)Program InfoBridge^(TM)and develops a survival model using Cox proportional hazards regression.The survival ...This paper studies the deterioration of bridge substructures utilizing the Long-Term Bridge Performance(LTBP)Program InfoBridge^(TM)and develops a survival model using Cox proportional hazards regression.The survival analysis is based on the National Bridge Inventory(NBI)dataset.The study calculates the survival rate of reinforced and prestressed concrete piles on bridges under marine conditions over a 29-year span(from 1992 to 2020).The state of Maryland is the primary focus of this study,with data from three neighboring regions,the District of Columbia,Virginia,and Delaware to expand the sample size.The data obtained from the National Bridge Inventory are condensed and filtered to acquire the most relevant information for model development.The Cox proportional hazards regression is applied to the condensed NBI data with six parameters:Age,ADT,ADTT,number of spans,span length,and structural length.Two survival models are generated for the bridge substructures:Reinforced and prestressed concrete piles in Maryland and reinforced and prestressed concrete piles in wet service conditions in the District of Columbia,Maryland,Delaware,and Virginia.Results from the Cox proportional hazards regression are used to construct Markov chains to demonstrate the sequence of the deterioration of bridge substructures.The Markov chains can be used as a tool to assist in the prediction and decision-making for repair,rehabilitation,and replacement of bridge piles.Based on the numerical model,the Pile Assessment Matrix Program(PAM)is developed to facilitate the assessment and maintenance of current bridge structures.The program integrates the NBI database with the inspection and research reports from various states’department of transportation,to serve as a tool for condition state simulation based on maintenance or rehabilitation strategies.展开更多
Objective:To investigate the trend of mortality by COVID-19 before and after the national vaccination program using joinpoint regression analysis from 19 February 2020 to 5 September 2022.Methods:In the present study,...Objective:To investigate the trend of mortality by COVID-19 before and after the national vaccination program using joinpoint regression analysis from 19 February 2020 to 5 September 2022.Methods:In the present study,a joinpoint regression analysis of monthly collected data on confirmed deaths of COVID-19 in Iran from February 19,2020 to September 5,2022 was performed.Results:After national vaccination in Iran,the trend of new monthly deaths due to COVID-19 was decreasing.The percentage of monthly changes from the beginning of the pandemic to the 19th month was 6.62%(95%CI:1.1,12.4),which had an increasing trend.From the 19th month to the end of the 31st month,the mortality trend was decreasing,and the percentage of monthly changes was-20.05%(95%CI:-8.3,-30.3)(P=0.002).The average percentage of monthly changes was-5%with a 95%CI of(-10.5,0.9).Conclusions:Along with other health measures,such as quarantine,wearing a mask,hand washing,social distancing,etc.,national vaccination significantly reduces the mortality rate of COVID-19.展开更多
Little research can be found in relation to the stability of anisotropic and heterogenous soils in three dimensions.In this paper,we propose a study on the three-dimensional(3D)undrained slopes in anisotropic and hete...Little research can be found in relation to the stability of anisotropic and heterogenous soils in three dimensions.In this paper,we propose a study on the three-dimensional(3D)undrained slopes in anisotropic and heterogenous clay using advanced upper and lower bounds finite element limit analysis(FELA).The obtained stability solutions are normalized,and presented by a stability number that is a function of three geometrical ratios and two material ratios,i.e.depth ratio,length ratio,slope angle,shear strength gradient ratio and anisotropic strength ratio.Numerical results are compared with experimental data in the literature,and charts are presented to cover a wide range of design parameters.Using the multivariate adaptive regression splines(MARS)analysis,the respective influence and sensitivity of each design parameter on the stability number and the failure mechanism are investigated.An empirical equation is also developed to effectively estimate the stability number.展开更多
The use of Amazon Web Services is growing rapidly as more users are adopting the technology.It has various functionalities that can be used by large corporates and individuals as well.Sentiment analysis is used to bui...The use of Amazon Web Services is growing rapidly as more users are adopting the technology.It has various functionalities that can be used by large corporates and individuals as well.Sentiment analysis is used to build an intelligent system that can study the opinions of the people and help to classify those related emotions.In this research work,sentiment analysis is performed on the AWS Elastic Compute Cloud(EC2)through Twitter data.The data is managed to the EC2 by using elastic load balancing.The collected data is subjected to preprocessing approaches to clean the data,and then machine learning-based logistic regression is employed to categorize the sentiments into positive and negative sentiments.High accuracy of 94.17%is obtained through the proposed machine learning model which is higher than the other models that are developed using the existing algorithms.展开更多
Internet of Things(IoT)is a popular social network in which devices are virtually connected for communicating and sharing information.This is applied greatly in business enterprises and government sectors for deliveri...Internet of Things(IoT)is a popular social network in which devices are virtually connected for communicating and sharing information.This is applied greatly in business enterprises and government sectors for delivering the services to their customers,clients and citizens.But,the interaction is success-ful only based on the trust that each device has on another.Thus trust is very much essential for a social network.As Internet of Things have access over sen-sitive information,it urges to many threats that lead data management to risk.This issue is addressed by trust management that help to take decision about trust-worthiness of requestor and provider before communication and sharing.Several trust-based systems are existing for different domain using Dynamic weight meth-od,Fuzzy classification,Bayes inference and very few Regression analysis for IoT.The proposed algorithm is based on Logistic Regression,which provide strong statistical background to trust prediction.To make our stand strong on regression support to trust,we have compared the performance with equivalent sound Bayes analysis using Beta distribution.The performance is studied in simu-lated IoT setup with Quality of Service(QoS)and Social parameters for the nodes.The proposed model performs better in terms of various metrics.An IoT connects heterogeneous devices such as tags and sensor devices for sharing of information and avail different application services.The most salient features of IoT system is to design it with scalability,extendibility,compatibility and resiliency against attack.The existing worksfinds a way to integrate direct and indirect trust to con-verge quickly and estimate the bias due to attacks in addition to the above features.展开更多
Predictive Emission Monitoring Systems (PEMS) offer a cost-effective and environmentally friendly alternative to Continuous Emission Monitoring Systems (CEMS) for monitoring pollution from industrial sources. Multiple...Predictive Emission Monitoring Systems (PEMS) offer a cost-effective and environmentally friendly alternative to Continuous Emission Monitoring Systems (CEMS) for monitoring pollution from industrial sources. Multiple regression is one of the fundamental statistical techniques to describe the relationship between dependent and independent variables. This model can be effectively used to develop a PEMS, to estimate the amount of pollution emitted by industrial sources, where the fuel composition and other process-related parameters are available. It often makes them sufficient to predict the emission discharge with acceptable accuracy. In cases where PEMS are accepted as an alternative method to CEMS, which use gas analyzers, they can provide cost savings and substantial benefits for ongoing system support and maintenance. The described mathematical concept is based on the matrix algebra representation in multiple regression involving multiple precision arithmetic techniques. Challenging numerical examples for statistical big data analysis, are investigated. Numerical examples illustrate computational accuracy and efficiency of statistical analysis due to increasing the precision level. The programming language C++ is used for mathematical model implementation. The data for research and development, including the dependent fuel and independent NOx emissions data, were obtained from CEMS software installed on a petrochemical plant.展开更多
In this paper, firstly, we propose a new method for choosing regularization parameter λ for lasso regression, which differs from traditional method such as multifold cross-validation, our new method gives the maximum...In this paper, firstly, we propose a new method for choosing regularization parameter λ for lasso regression, which differs from traditional method such as multifold cross-validation, our new method gives the maximum value of parameter λ directly. Secondly, by considering another prior form over model space in the Bayes approach, we propose a new extended Bayes information criterion family, and under some mild condition, our new EBIC (NEBIC) is shown to be consistent. Then we apply our new method to choose parameter for sequential lasso regression which selects features by sequentially solving partially penalized least squares problems where the features selected in earlier steps are not penalized in the subsequent steps. Then sequential lasso uses NEBIC as the stopping rule. Finally, we apply our algorithm to identify the nonzero entries of precision matrix for high-dimensional linear discrimination analysis. Simulation results demonstrate that our algorithm has a lower misclassification rate and less computation time than its competing methods under considerations.展开更多
The interception probability of a single missile is the basis for combat plan design and weapon performance evaluation,while its influencing factors are complex and mutually coupled.Existing calculation methods have v...The interception probability of a single missile is the basis for combat plan design and weapon performance evaluation,while its influencing factors are complex and mutually coupled.Existing calculation methods have very limited analysis of the influence mechanism of influencing factors,and none of them has analyzed the influence of the guidance law.This paper considers the influencing factors of both the interceptor and the target more comprehensively.Interceptor parameters include speed,guidance law,guidance error,fuze error,and fragment killing ability,while target performance includes speed,maneuverability,and vulnerability.In this paper,an interception model is established,Monte Carlo simulation is carried out,and the influence mechanism of each factor is analyzed based on the model and simulation results.Finally,this paper proposes a classification-regression neural network to quickly estimate the interception probability based on the value of influencing factors.The proposed method reduces the interference of invalid interception data to valid data,so its prediction accuracy is significantly better than that of pure regression neural networks.展开更多
This paper presents a case study on the IPUMS NHIS database,which provides data from censuses and surveys on the health of the U.S.population,including data related to COVID-19.By addressing gaps in previous studies,w...This paper presents a case study on the IPUMS NHIS database,which provides data from censuses and surveys on the health of the U.S.population,including data related to COVID-19.By addressing gaps in previous studies,we propose a machine learning approach to train predictive models for identifying and measuring factors that affect the severity of COVID-19 symptoms.Our experiments focus on four groups of factors:demographic,socio-economic,health condition,and related to COVID-19 vaccination.By analysing the sensitivity of the variables used to train the models and the VEC(variable effect characteristics)analysis on the variable values,we identify and measure importance of various factors that influence the severity of COVID-19 symptoms.展开更多
[Objectives]The purpose of this study was to provide reference for cultivation and promotion of a new sugarcane variety Yuetang 03-373,on the basis of analyzing and summarizing the characters of the variety.[Methods]C...[Objectives]The purpose of this study was to provide reference for cultivation and promotion of a new sugarcane variety Yuetang 03-373,on the basis of analyzing and summarizing the characters of the variety.[Methods]Correlation,multiple regression and path analyses were performed for the yield and yield components of Yuetang 03-373.[Results]Correlation analysis shows that cane yield was significantly correlated with millable stalk number,stalk length and stalk diameter,and among them,the correlation with millable stalk number was the strongest.Multiple regression and path analyses show that millable stalk number contributed the most to cane yield,followed by stalk length,and stalk diameter contributed the least.The regression equation of cane yield against the three yield components was y=-2.8713+1.5497x1+5.8990x2-395.4294x3(R=0.9672**).[Conclusions]Millable stalk number and stalk length were the important and major factors for high yield of Yuetang 03-373,indicating that Yuetang 03-373 is a sugarcane variety of millable stalk type.In cultivation,full play should be given to the advantage of Yuetang 03-373 in millable stalk number,as well as stalk length(plant height),in order to achieve the purpose of increasing yield.展开更多
A multivariable regression analysis of the in-situ stress field, which considers the non-linear deformation behavior of faults in practical projects, is presented based on a newly developed three-dimensional displacem...A multivariable regression analysis of the in-situ stress field, which considers the non-linear deformation behavior of faults in practical projects, is presented based on a newly developed three-dimensional displacement discontinuity method (DDM) program. The Bar- ton-Bandis model and the Kulhaway model are adopted as the normal and the tangential deformation model of faults, respectively, where the Mohr-Coulomb failure criterion is satisfied. In practical projects, the values of the mechanical parameters of rock and faults are restricted in a bounded range for in-situ test, and the optimal mechanical parameters are obtained from this range by a loop. Comparing with the traditional finite element method (FEM), the DDM regression results are more accurate.展开更多
This paper presents an analysis to forecast the loads of an isolated area where the history of load is not available or the history may not represent the realistic demand of electricity. The analysis is done through l...This paper presents an analysis to forecast the loads of an isolated area where the history of load is not available or the history may not represent the realistic demand of electricity. The analysis is done through linear regression and based on the identification of factors on which electrical load growth depends. To determine the identification factors, areas are selected whose histories of load growth rate known and the load growth deciding factors are similar to those of the isolated area. The proposed analysis is applied to an isolated area of Bangladesh, called Swandip where a past history of electrical load demand is not available and also there is no possibility of connecting the area with the main land grid system.展开更多
The blast-induced ground vibration prediction using scaled distance regression analysis is one of the most popular methods employed by engineers for many decades. It uses the maximum charge per delay and distance of m...The blast-induced ground vibration prediction using scaled distance regression analysis is one of the most popular methods employed by engineers for many decades. It uses the maximum charge per delay and distance of monitoring as the major factors for predicting the peak particle velocity(PPV). It is established that the PPV is caused by the maximum charge per delay which varies with the distance of monitoring and site geology. While conducting a production blasting, the waves induced by blasting of different holes interfere destructively with each other, which may result in higher PPV than the predicted value with scaled distance regression analysis. This phenomenon of interference/superimposition of waves is not considered while using scaled distance regression analysis. In this paper, an attempt has been made to compare the predicted values of blast-induced ground vibration using multi-hole trial blasting with single-hole blasting in an opencast coal mine under the same geological condition. Further,the modified prediction equation for the multi-hole trial blasting was obtained using single-hole regression analysis. The error between predicted and actual values of multi-hole blast-induced ground vibration was found to be reduced by 8.5%.展开更多
文摘Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.
文摘In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.
文摘BACKGROUND The spread of the severe acute respiratory syndrome coronavirus 2 outbreak worldwide has caused concern regarding the mortality rate caused by the infection.The determinants of mortality on a global scale cannot be fully understood due to lack of information.AIM To identify key factors that may explain the variability in case lethality across countries.METHODS We identified 21 Potential risk factors for coronavirus disease 2019(COVID-19)case fatality rate for all the countries with available data.We examined univariate relationships of each variable with case fatality rate(CFR),and all independent variables to identify candidate variables for our final multiple model.Multiple regression analysis technique was used to assess the strength of relationship.RESULTS The mean of COVID-19 mortality was 1.52±1.72%.There was a statistically significant inverse correlation between health expenditure,and number of computed tomography scanners per 1 million with CFR,and significant direct correlation was found between literacy,and air pollution with CFR.This final model can predict approximately 97%of the changes in CFR.CONCLUSION The current study recommends some new predictors explaining affect mortality rate.Thus,it could help decision-makers develop health policies to fight COVID-19.
文摘BACKGROUND Radiation pneumonitis(RP)is a severe complication of thoracic radiotherapy that may lead to dyspnea and lung fibrosis,and negatively affects patients’quality of life.AIM To carry out multiple regression analysis on the influencing factors of radiation pneumonitis.METHODS Records of 234 patients receiving chest radiotherapy in Huzhou Central Hospital(Huzhou,Zhejiang Province,China)from January 2018 to February 2021,and the patients were divided into either a study group or a control group based on the presence of radiation pneumonitis or not.Among them,93 patients with radiation pneumonitis were included in the study group and 141 without radiation pneumonitis were included in the control group.General characteristics,and radiation and imaging examination data of the two groups were collected and compared.Due to the statistical significance observed,multiple regression analysis was performed on age,tumor type,chemotherapy history,forced vital capacity(FVC),forced expiratory volume in the first second(FEV1),carbon monoxide diffusion volume(DLCO),FEV1/FVC ratio,planned target area(PTV),mean lung dose(MLD),total number of radiation fields,percentage of lung tissue in total lung volume(vdose),probability of normal tissue complications(NTCP),and other factors.RESULTS The proportions of patients aged≥60 years and those with the diagnosis of lung cancer and a history of chemotherapy in the study group were higher than those in the control group(P<0.05);FEV1,DLCO,and FEV1/FVC ratio in the study group were lower than those in the control group(P<0.05),while PTV,MLD,total field number,vdose,and NTCP were higher than in the control group(P<0.05).Logistic regression analysis showed that age,lung cancer diagnosis,chemotherapy history,FEV1,FEV1/FVC ratio,PTV,MLD,total number of radiation fields,vdose,and NTCP were risk factors for radiation pneumonitis.CONCLUSION We have identified patient age,type of lung cancer,history of chemotherapy,lung function,and radiotherapy parameters as risk factors for radiation pneumonitis.Comprehensive evaluation and examination should be carried out before radiotherapy to effectively prevent radiation pneumonitis.
文摘Machine Learning(ML)has changed clinical diagnostic procedures drastically.Especially in Cardiovascular Diseases(CVD),the use of ML is indispensable to reducing human errors.Enormous studies focused on disease prediction but depending on multiple parameters,further investigations are required to upgrade the clinical procedures.Multi-layered implementation of ML also called Deep Learning(DL)has unfolded new horizons in the field of clinical diagnostics.DL formulates reliable accuracy with big datasets but the reverse is the case with small datasets.This paper proposed a novel method that deals with the issue of less data dimensionality.Inspired by the regression analysis,the proposed method classifies the data by going through three different stages.In the first stage,feature representation is converted into probabilities using multiple regression techniques,the second stage grasps the probability conclusions from the previous stage and the third stage fabricates the final classifications.Extensive experiments were carried out on the Cleveland heart disease dataset.The results show significant improvement in classification accuracy.It is evident from the comparative results of the paper that the prevailing statistical ML methods are no more stagnant disease prediction techniques in demand in the future.
文摘Rapidly spreading COVID-19 virus and its variants, especially in metropolitan areas around the world, became a major health public concern. The tendency of COVID-19 pandemic and statistical modelling represents an urgent challenge in the United States for which there are few solutions. In this paper, we demonstrate combining Fourier terms for capturing seasonality with ARIMA errors and other dynamics in the data. Therefore, we have analyzed 156 weeks COVID-19 dataset on national level using Dynamic Harmonic Regression model, including simulation analysis and accuracy improvement from 2020 to 2023. Most importantly, we provide new advanced pathways which may serve as targets for developing new solutions and approaches.
文摘BACKGROUND Endometrial cancer is one of the most commonly diagnosed gynecological cancers worldwide,and early-stage high-risk endometrial cancer has a poor prognosis.Adjuvant treatments after surgery,such as chemotherapy and radiotherapy,have been widely used in clinical practice to improve patient survival.Medroxyprogesterone acetate is a synthetic progestogen that has been reported to have potential anticancer effects in endometrial cancer.However,its efficacy,safety,and longterm prognostic benefits as an adjuvant treatment for endometrial cancer remain controversial.Therefore,this study aimed to observe the efficacy and prognostic impact of adjuvant medroxyprogesterone acetate treatment in patients with earlystage high-risk endometrial cancer and evaluate its safety.AIM To observe the efficacy and prognosis of adjuvant treatment of endometrial cancer with medroxyprogesterone acetate and to evaluate its safety.METHODS We collected the clinical data of 200 patients with early-stage high-risk endometrial cancer who were admitted to the Department of Obstetrics and Gynecology of our hospital from January 2018 to December 2022.The control group(100 patients)underwent conventional surgical treatment,and the study group(100 patients)was administered adjuvant medroxyprogesterone acetate tablets on top of the control group.The Kaplan-Meier curve analysis and log-rank test were performed to determine the possible factors influencing the 5-year cumulative survival rate in the patients.The Cox regression analysis was performed to identify the factors influencing the survival prognosis of endometrial cancer.RESULTS According to the Cox regression analysis,age[hazard ratio(HR)=4.636,95%confidence interval(95%CI):1.411-15.237],pathological type(HR=6.943,95%CI:2.299-20.977),molecular typing(HR=5.789,95%CI:3.305-10.141),and myometrial infiltration(HR=5.768,95%CI:1.898-17.520)were factors influencing the prognosis of patients with early-stage high-risk endometrial cancer.CONCLUSION Age,pathological type,molecular typing,and myometrial infiltration were all relevant factors affecting the prognosis of early-stage high-risk endometrial cancer.The potential long-term prognostic benefit of adjuvant postoperative radiotherapy in patients with early-stage high-risk endometrial cancer is worthy of clinical consideration.
基金This research receives funding from the Maryland Department of Transportation State Highway Administration.
文摘This paper studies the deterioration of bridge substructures utilizing the Long-Term Bridge Performance(LTBP)Program InfoBridge^(TM)and develops a survival model using Cox proportional hazards regression.The survival analysis is based on the National Bridge Inventory(NBI)dataset.The study calculates the survival rate of reinforced and prestressed concrete piles on bridges under marine conditions over a 29-year span(from 1992 to 2020).The state of Maryland is the primary focus of this study,with data from three neighboring regions,the District of Columbia,Virginia,and Delaware to expand the sample size.The data obtained from the National Bridge Inventory are condensed and filtered to acquire the most relevant information for model development.The Cox proportional hazards regression is applied to the condensed NBI data with six parameters:Age,ADT,ADTT,number of spans,span length,and structural length.Two survival models are generated for the bridge substructures:Reinforced and prestressed concrete piles in Maryland and reinforced and prestressed concrete piles in wet service conditions in the District of Columbia,Maryland,Delaware,and Virginia.Results from the Cox proportional hazards regression are used to construct Markov chains to demonstrate the sequence of the deterioration of bridge substructures.The Markov chains can be used as a tool to assist in the prediction and decision-making for repair,rehabilitation,and replacement of bridge piles.Based on the numerical model,the Pile Assessment Matrix Program(PAM)is developed to facilitate the assessment and maintenance of current bridge structures.The program integrates the NBI database with the inspection and research reports from various states’department of transportation,to serve as a tool for condition state simulation based on maintenance or rehabilitation strategies.
文摘Objective:To investigate the trend of mortality by COVID-19 before and after the national vaccination program using joinpoint regression analysis from 19 February 2020 to 5 September 2022.Methods:In the present study,a joinpoint regression analysis of monthly collected data on confirmed deaths of COVID-19 in Iran from February 19,2020 to September 5,2022 was performed.Results:After national vaccination in Iran,the trend of new monthly deaths due to COVID-19 was decreasing.The percentage of monthly changes from the beginning of the pandemic to the 19th month was 6.62%(95%CI:1.1,12.4),which had an increasing trend.From the 19th month to the end of the 31st month,the mortality trend was decreasing,and the percentage of monthly changes was-20.05%(95%CI:-8.3,-30.3)(P=0.002).The average percentage of monthly changes was-5%with a 95%CI of(-10.5,0.9).Conclusions:Along with other health measures,such as quarantine,wearing a mask,hand washing,social distancing,etc.,national vaccination significantly reduces the mortality rate of COVID-19.
文摘Little research can be found in relation to the stability of anisotropic and heterogenous soils in three dimensions.In this paper,we propose a study on the three-dimensional(3D)undrained slopes in anisotropic and heterogenous clay using advanced upper and lower bounds finite element limit analysis(FELA).The obtained stability solutions are normalized,and presented by a stability number that is a function of three geometrical ratios and two material ratios,i.e.depth ratio,length ratio,slope angle,shear strength gradient ratio and anisotropic strength ratio.Numerical results are compared with experimental data in the literature,and charts are presented to cover a wide range of design parameters.Using the multivariate adaptive regression splines(MARS)analysis,the respective influence and sensitivity of each design parameter on the stability number and the failure mechanism are investigated.An empirical equation is also developed to effectively estimate the stability number.
基金This research project was supported by the Deanship of Scientific Research,Prince Sattam Bin Abdulaziz University,KSA,Project Grant No.2021/01/17783,Sha M,www.psau.edu.sa.
文摘The use of Amazon Web Services is growing rapidly as more users are adopting the technology.It has various functionalities that can be used by large corporates and individuals as well.Sentiment analysis is used to build an intelligent system that can study the opinions of the people and help to classify those related emotions.In this research work,sentiment analysis is performed on the AWS Elastic Compute Cloud(EC2)through Twitter data.The data is managed to the EC2 by using elastic load balancing.The collected data is subjected to preprocessing approaches to clean the data,and then machine learning-based logistic regression is employed to categorize the sentiments into positive and negative sentiments.High accuracy of 94.17%is obtained through the proposed machine learning model which is higher than the other models that are developed using the existing algorithms.
文摘Internet of Things(IoT)is a popular social network in which devices are virtually connected for communicating and sharing information.This is applied greatly in business enterprises and government sectors for delivering the services to their customers,clients and citizens.But,the interaction is success-ful only based on the trust that each device has on another.Thus trust is very much essential for a social network.As Internet of Things have access over sen-sitive information,it urges to many threats that lead data management to risk.This issue is addressed by trust management that help to take decision about trust-worthiness of requestor and provider before communication and sharing.Several trust-based systems are existing for different domain using Dynamic weight meth-od,Fuzzy classification,Bayes inference and very few Regression analysis for IoT.The proposed algorithm is based on Logistic Regression,which provide strong statistical background to trust prediction.To make our stand strong on regression support to trust,we have compared the performance with equivalent sound Bayes analysis using Beta distribution.The performance is studied in simu-lated IoT setup with Quality of Service(QoS)and Social parameters for the nodes.The proposed model performs better in terms of various metrics.An IoT connects heterogeneous devices such as tags and sensor devices for sharing of information and avail different application services.The most salient features of IoT system is to design it with scalability,extendibility,compatibility and resiliency against attack.The existing worksfinds a way to integrate direct and indirect trust to con-verge quickly and estimate the bias due to attacks in addition to the above features.
文摘Predictive Emission Monitoring Systems (PEMS) offer a cost-effective and environmentally friendly alternative to Continuous Emission Monitoring Systems (CEMS) for monitoring pollution from industrial sources. Multiple regression is one of the fundamental statistical techniques to describe the relationship between dependent and independent variables. This model can be effectively used to develop a PEMS, to estimate the amount of pollution emitted by industrial sources, where the fuel composition and other process-related parameters are available. It often makes them sufficient to predict the emission discharge with acceptable accuracy. In cases where PEMS are accepted as an alternative method to CEMS, which use gas analyzers, they can provide cost savings and substantial benefits for ongoing system support and maintenance. The described mathematical concept is based on the matrix algebra representation in multiple regression involving multiple precision arithmetic techniques. Challenging numerical examples for statistical big data analysis, are investigated. Numerical examples illustrate computational accuracy and efficiency of statistical analysis due to increasing the precision level. The programming language C++ is used for mathematical model implementation. The data for research and development, including the dependent fuel and independent NOx emissions data, were obtained from CEMS software installed on a petrochemical plant.
文摘In this paper, firstly, we propose a new method for choosing regularization parameter λ for lasso regression, which differs from traditional method such as multifold cross-validation, our new method gives the maximum value of parameter λ directly. Secondly, by considering another prior form over model space in the Bayes approach, we propose a new extended Bayes information criterion family, and under some mild condition, our new EBIC (NEBIC) is shown to be consistent. Then we apply our new method to choose parameter for sequential lasso regression which selects features by sequentially solving partially penalized least squares problems where the features selected in earlier steps are not penalized in the subsequent steps. Then sequential lasso uses NEBIC as the stopping rule. Finally, we apply our algorithm to identify the nonzero entries of precision matrix for high-dimensional linear discrimination analysis. Simulation results demonstrate that our algorithm has a lower misclassification rate and less computation time than its competing methods under considerations.
基金supported by the Foundation Strengthening Program Technology Field Foundation(2020-JCJQ-JJ-132)。
文摘The interception probability of a single missile is the basis for combat plan design and weapon performance evaluation,while its influencing factors are complex and mutually coupled.Existing calculation methods have very limited analysis of the influence mechanism of influencing factors,and none of them has analyzed the influence of the guidance law.This paper considers the influencing factors of both the interceptor and the target more comprehensively.Interceptor parameters include speed,guidance law,guidance error,fuze error,and fragment killing ability,while target performance includes speed,maneuverability,and vulnerability.In this paper,an interception model is established,Monte Carlo simulation is carried out,and the influence mechanism of each factor is analyzed based on the model and simulation results.Finally,this paper proposes a classification-regression neural network to quickly estimate the interception probability based on the value of influencing factors.The proposed method reduces the interference of invalid interception data to valid data,so its prediction accuracy is significantly better than that of pure regression neural networks.
文摘This paper presents a case study on the IPUMS NHIS database,which provides data from censuses and surveys on the health of the U.S.population,including data related to COVID-19.By addressing gaps in previous studies,we propose a machine learning approach to train predictive models for identifying and measuring factors that affect the severity of COVID-19 symptoms.Our experiments focus on four groups of factors:demographic,socio-economic,health condition,and related to COVID-19 vaccination.By analysing the sensitivity of the variables used to train the models and the VEC(variable effect characteristics)analysis on the variable values,we identify and measure importance of various factors that influence the severity of COVID-19 symptoms.
基金GDAS'Project of Science and Technology Development(2020GDASYL-20200302005)Science and Technology Planning Project of Zhanjiang City(2019A01030)Guangdong Provincial Team of Technical System Innovation for Sugarcane Sisal Hemp Industry(2019KJ104-15).
文摘[Objectives]The purpose of this study was to provide reference for cultivation and promotion of a new sugarcane variety Yuetang 03-373,on the basis of analyzing and summarizing the characters of the variety.[Methods]Correlation,multiple regression and path analyses were performed for the yield and yield components of Yuetang 03-373.[Results]Correlation analysis shows that cane yield was significantly correlated with millable stalk number,stalk length and stalk diameter,and among them,the correlation with millable stalk number was the strongest.Multiple regression and path analyses show that millable stalk number contributed the most to cane yield,followed by stalk length,and stalk diameter contributed the least.The regression equation of cane yield against the three yield components was y=-2.8713+1.5497x1+5.8990x2-395.4294x3(R=0.9672**).[Conclusions]Millable stalk number and stalk length were the important and major factors for high yield of Yuetang 03-373,indicating that Yuetang 03-373 is a sugarcane variety of millable stalk type.In cultivation,full play should be given to the advantage of Yuetang 03-373 in millable stalk number,as well as stalk length(plant height),in order to achieve the purpose of increasing yield.
基金financially supported by the Western Transport Technical Project of the Ministry of Transport, China (No. 2009318000046)
文摘A multivariable regression analysis of the in-situ stress field, which considers the non-linear deformation behavior of faults in practical projects, is presented based on a newly developed three-dimensional displacement discontinuity method (DDM) program. The Bar- ton-Bandis model and the Kulhaway model are adopted as the normal and the tangential deformation model of faults, respectively, where the Mohr-Coulomb failure criterion is satisfied. In practical projects, the values of the mechanical parameters of rock and faults are restricted in a bounded range for in-situ test, and the optimal mechanical parameters are obtained from this range by a loop. Comparing with the traditional finite element method (FEM), the DDM regression results are more accurate.
文摘This paper presents an analysis to forecast the loads of an isolated area where the history of load is not available or the history may not represent the realistic demand of electricity. The analysis is done through linear regression and based on the identification of factors on which electrical load growth depends. To determine the identification factors, areas are selected whose histories of load growth rate known and the load growth deciding factors are similar to those of the isolated area. The proposed analysis is applied to an isolated area of Bangladesh, called Swandip where a past history of electrical load demand is not available and also there is no possibility of connecting the area with the main land grid system.
文摘The blast-induced ground vibration prediction using scaled distance regression analysis is one of the most popular methods employed by engineers for many decades. It uses the maximum charge per delay and distance of monitoring as the major factors for predicting the peak particle velocity(PPV). It is established that the PPV is caused by the maximum charge per delay which varies with the distance of monitoring and site geology. While conducting a production blasting, the waves induced by blasting of different holes interfere destructively with each other, which may result in higher PPV than the predicted value with scaled distance regression analysis. This phenomenon of interference/superimposition of waves is not considered while using scaled distance regression analysis. In this paper, an attempt has been made to compare the predicted values of blast-induced ground vibration using multi-hole trial blasting with single-hole blasting in an opencast coal mine under the same geological condition. Further,the modified prediction equation for the multi-hole trial blasting was obtained using single-hole regression analysis. The error between predicted and actual values of multi-hole blast-induced ground vibration was found to be reduced by 8.5%.