In this paper, a logistical regression statistical analysis (LR) is presented for a set of variables used in experimental measurements in reversed field pinch (RFP) machines, commonly known as “slinky mode” (SM), ob...In this paper, a logistical regression statistical analysis (LR) is presented for a set of variables used in experimental measurements in reversed field pinch (RFP) machines, commonly known as “slinky mode” (SM), observed to travel around the torus in Madison Symmetric Torus (MST). The LR analysis is used to utilize the modified Sine-Gordon dynamic equation model to predict with high confidence whether the slinky mode will lock or not lock when compared to the experimentally measured motion of the slinky mode. It is observed that under certain conditions, the slinky mode “locks” at or near the intersection of poloidal and/or toroidal gaps in MST. However, locked mode cease to travel around the torus;while unlocked mode keeps traveling without a change in the energy, making it hard to determine an exact set of conditions to predict locking/unlocking behaviour. The significant key model parameters determined by LR analysis are shown to improve the Sine-Gordon model’s ability to determine the locking/unlocking of magnetohydrodyamic (MHD) modes. The LR analysis of measured variables provides high confidence in anticipating locking versus unlocking of slinky mode proven by relational comparisons between simulations and the experimentally measured motion of the slinky mode in MST.展开更多
The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate ...The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate estimation of cropland burned area is both crucial and challenging,especially for the small and fragmented burned scars in China.Here we developed an automated burned area mapping algorithm that was implemented using Sentinel-2 Multi Spectral Instrument(MSI)data and its effectiveness was tested taking Songnen Plain,Northeast China as a case using satellite image of 2020.We employed a logistic regression method for integrating multiple spectral data into a synthetic indicator,and compared the results with manually interpreted burned area reference maps and the Moderate-Resolution Imaging Spectroradiometer(MODIS)MCD64A1 burned area product.The overall accuracy of the single variable logistic regression was 77.38%to 86.90%and 73.47%to 97.14%for the 52TCQ and 51TYM cases,respectively.In comparison,the accuracy of the burned area map was improved to 87.14%and 98.33%for the 52TCQ and 51TYM cases,respectively by multiple variable logistic regression of Sentind-2 images.The balance of omission error and commission error was also improved.The integration of multiple spectral data combined with a logistic regression method proves to be effective for burned area detection,offering a highly automated process with an automatic threshold determination mechanism.This method exhibits excellent extensibility and flexibility taking the image tile as the operating unit.It is suitable for burned area detection at a regional scale and can also be implemented with other satellite data.展开更多
Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/appr...Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/approach:The study is based on Monte Carlo simulations.The methods are compared in terms of three measures of accuracy:specificity and two kinds of sensitivity.A loss function combining sensitivity and specificity is introduced and used for a final comparison.Findings:The choice of method depends on how much the users emphasize sensitivity against specificity.It also depends on the sample size.For a typical logistic regression setting with a moderate sample size and a small to moderate effect size,either BIC,BICc or Lasso seems to be optimal.Research limitations:Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data.Thus,more simulations are needed.Practical implications:Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper.Alternatively,they could run their own simulations and calculate the loss function.Originality/value:This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression.The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.展开更多
In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste...In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.展开更多
This research introduces a novel approach to improve and optimize the predictive capacity of consumer purchase behaviors on e-commerce platforms. This study presented an introduction to the fundamental concepts of the...This research introduces a novel approach to improve and optimize the predictive capacity of consumer purchase behaviors on e-commerce platforms. This study presented an introduction to the fundamental concepts of the logistic regression algorithm. In addition, it analyzed user data obtained from an e-commerce platform. The original data were preprocessed, and a consumer purchase prediction model was developed for the e-commerce platform using the logistic regression method. The comparison study used the classic random forest approach, further enhanced by including the K-fold cross-validation method. Evaluation of the accuracy of the model’s classification was conducted using performance indicators that included the accuracy rate, the precision rate, the recall rate, and the F1 score. A visual examination determined the significance of the findings. The findings suggest that employing the logistic regression algorithm to forecast customer purchase behaviors on e-commerce platforms can improve the efficacy of the approach and yield more accurate predictions. This study serves as a valuable resource for improving the precision of forecasting customers’ purchase behaviors on e-commerce platforms. It has significant practical implications for optimizing the operational efficiency of e-commerce platforms.展开更多
Internet of Things(IoT)is a popular social network in which devices are virtually connected for communicating and sharing information.This is applied greatly in business enterprises and government sectors for deliveri...Internet of Things(IoT)is a popular social network in which devices are virtually connected for communicating and sharing information.This is applied greatly in business enterprises and government sectors for delivering the services to their customers,clients and citizens.But,the interaction is success-ful only based on the trust that each device has on another.Thus trust is very much essential for a social network.As Internet of Things have access over sen-sitive information,it urges to many threats that lead data management to risk.This issue is addressed by trust management that help to take decision about trust-worthiness of requestor and provider before communication and sharing.Several trust-based systems are existing for different domain using Dynamic weight meth-od,Fuzzy classification,Bayes inference and very few Regression analysis for IoT.The proposed algorithm is based on Logistic Regression,which provide strong statistical background to trust prediction.To make our stand strong on regression support to trust,we have compared the performance with equivalent sound Bayes analysis using Beta distribution.The performance is studied in simu-lated IoT setup with Quality of Service(QoS)and Social parameters for the nodes.The proposed model performs better in terms of various metrics.An IoT connects heterogeneous devices such as tags and sensor devices for sharing of information and avail different application services.The most salient features of IoT system is to design it with scalability,extendibility,compatibility and resiliency against attack.The existing worksfinds a way to integrate direct and indirect trust to con-verge quickly and estimate the bias due to attacks in addition to the above features.展开更多
Autism spectrum disorder(ASD),classified as a developmental disability,is now more common in children than ever.A drastic increase in the rate of autism spectrum disorder in children worldwide demands early detection ...Autism spectrum disorder(ASD),classified as a developmental disability,is now more common in children than ever.A drastic increase in the rate of autism spectrum disorder in children worldwide demands early detection of autism in children.Parents can seek professional help for a better prognosis of the child’s therapy when ASD is diagnosed under five years.This research study aims to develop an automated tool for diagnosing autism in children.The computer-aided diagnosis tool for ASD detection is designed and developed by a novel methodology that includes data acquisition,feature selection,and classification phases.The most deterministic features are selected from the self-acquired dataset by novel feature selection methods before classification.The Imperialistic competitive algorithm(ICA)based on empires conquering colonies performs feature selection in this study.The performance of Logistic Regression(LR),Decision tree,K-Nearest Neighbor(KNN),and Random Forest(RF)classifiers are experimentally studied in this research work.The experimental results prove that the Logistic regression classifier exhibits the highest accuracy for the self-acquired dataset.The ASD detection is evaluated experimentally with the Least Absolute Shrinkage and Selection Operator(LASSO)feature selection method and different classifiers.The Exploratory Data Analysis(EDA)phase has uncovered crucial facts about the data,like the correlation of the features in the dataset with the class variable.展开更多
This paper focuses on ozone prediction in the atmosphere using a machine learning approach. We utilize air pollutant and meteorological variable datasets from the El Paso area to classify ozone levels as high or low. ...This paper focuses on ozone prediction in the atmosphere using a machine learning approach. We utilize air pollutant and meteorological variable datasets from the El Paso area to classify ozone levels as high or low. The LR and ANN algorithms are employed to train the datasets. The models demonstrate a remarkably high classification accuracy of 89.3% in predicting ozone levels on a given day. Evaluation metrics reveal that both the ANN and LR models exhibit accuracies of 89.3% and 88.4%, respectively. Additionally, the AUC values for both models are comparable, with the ANN achieving 95.4% and the LR obtaining 95.2%. The lower the cross-entropy loss (log loss), the higher the model’s accuracy or performance. Our ANN model yields a log loss of 3.74, while the LR model shows a log loss of 6.03. The prediction time for the ANN model is approximately 0.00 seconds, whereas the LR model takes 0.02 seconds. Our odds ratio analysis indicates that features such as “Solar radiation”, “Std. Dev. Wind Direction”, “outdoor temperature”, “dew point temperature”, and “PM10” contribute to high ozone levels in El Paso, Texas. Based on metrics such as accuracy, error rate, log loss, and prediction time, the ANN model proves to be faster and more suitable for ozone classification in the El Paso, Texas area.展开更多
In this paper, a weighted maximum likelihood technique (WMLT) for the logistic regression model is presented. This method depended on a weight function that is continuously adaptable using Mahalanobis distances for pr...In this paper, a weighted maximum likelihood technique (WMLT) for the logistic regression model is presented. This method depended on a weight function that is continuously adaptable using Mahalanobis distances for predictor variables. Under the model, the asymptotic consistency of the suggested estimator is demonstrated and properties of finite-sample are also investigated via simulation. In simulation studies and real data sets, it is observed that the newly proposed technique demonstrated the greatest performance among all estimators compared.展开更多
This paper presents a case study on the IPUMS NHIS database,which provides data from censuses and surveys on the health of the U.S.population,including data related to COVID-19.By addressing gaps in previous studies,w...This paper presents a case study on the IPUMS NHIS database,which provides data from censuses and surveys on the health of the U.S.population,including data related to COVID-19.By addressing gaps in previous studies,we propose a machine learning approach to train predictive models for identifying and measuring factors that affect the severity of COVID-19 symptoms.Our experiments focus on four groups of factors:demographic,socio-economic,health condition,and related to COVID-19 vaccination.By analysing the sensitivity of the variables used to train the models and the VEC(variable effect characteristics)analysis on the variable values,we identify and measure importance of various factors that influence the severity of COVID-19 symptoms.展开更多
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep...Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.展开更多
BACKGROUND Orthodontic treatment can easily cause local soft tissue reactions in the oral cavity of patients under mechanical stress,leading to oral mucosal ulcers and affecting their quality of life.At present,only l...BACKGROUND Orthodontic treatment can easily cause local soft tissue reactions in the oral cavity of patients under mechanical stress,leading to oral mucosal ulcers and affecting their quality of life.At present,only limited literature has explored the factors leading to oral ulcers in orthodontic treatment,and these research results are still controversial.AIM To investigate the current status and related factors of oral mucosal ulcers during orthodontic treatment,aiming to provide a valuable reference for preventing this disease in clinical practice.METHODS A total of 587 patients who underwent orthodontic treatment at the Peking University School of Stomatology and Hospital of Stomatology between 2020 and 2022 were selected and allocated to an observation or control group according to the incidence of oral mucosal ulcers during orthodontic therapy.A questionnaire survey was constructed to collect patient data,including basic information,lifestyle and eating habits,treatment details,mental factors,and trace element levels,and a comparative analysis of this data was performed between the two groups.RESULTS A logistic regression model with oral ulcers as the dependent variable was established.The regression results showed that age(≥60 years:odds ratio[OR]:6.820;95%confidence interval[CI]:2.226–20.893),smoking history(smoking:OR:4.434;95%CI:2.527–7.782),toothbrush hardness(hard:OR:2.804;95%CI:1.746–4.505),dietary temperature(hot diet:OR:1.399;95%CI:1.220–1.722),treatment course(>1 year:OR:3.830;95%CI:2.203–6.659),and tooth brushing frequency(>1 time per day:OR:0.228;95%CI:0.138–0.377)were independent factors for oral mucosal ulcers(P<0.05).Furthermore,Zn level(OR:0.945;95%CI:0.927–0.964)was a protective factor against oral ulcers,while the SAS(OR:1.284;95%CI:1.197–1.378)and SDS(OR:1.322;95%CI:1.231–1.419)scores were risk factors.CONCLUSION Age≥60 years,smoking history,hard toothbrush,hot diet,treatment course for>1 year,tooth brushing frequency of≤1 time per day,and mental anxiety are independent risk factors for oral mucosal ulcers.Therefore,these factors should receive clinical attention and be incorporated into the development and optimization of preventive strategies for reducing oral ulcer incidence.展开更多
The primary objective of landslide susceptibility mapping is the prediction of potential landslides in landslide-prone areas. The predictive power of a landslide susceptibility mapping model could be tested in an adja...The primary objective of landslide susceptibility mapping is the prediction of potential landslides in landslide-prone areas. The predictive power of a landslide susceptibility mapping model could be tested in an adjacent area of similar geo- environmental conditions to find out the reliability. Both the 2oo8 Wenchuan Earthquake and the 2o13 Lushan Earthquake occurred in the Longmen Mountain seismic zone, with similar topographical and geological conditions. The two earthquakes are both featured by thrust fault and similar seismic mechanism This paper adopted the susceptibility mapping model of co-seismic landslides triggered by Wenchuan earthquake to predict the spatial distribution of landslides induced by Lushan earthquake. Six influencing parameters were taken into consideration: distance from the seismic fault, slope gradient, lithology, distance from drainage, elevation and Peak Ground Acceleration (PGA). The preliminary results suggested that the zones with high susceptibility of co- seismic landslides were mainly distributed in the mountainous areas of Lushan, Baoxing and Tianquan counties. The co-seismic landslide susceptibility map was completed in two days after the quake and sent to the field investigators to provide guidance for rescue and relief work. The predictive power of the susceptibility map was validated by ROC curve analysis method using 2o37 co-seismic landslides in the epicenter area. The AUC value of o.71o indicated that the susceptibility model derived from Wenchuan Earthquake landslides showed good accuracy inpredicting the landslides triggered by Lushan earthquake.展开更多
Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Co...Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Conghua District,which is the most prone to landslide disasters in Guangzhou,was selected for landslide susceptibility evaluation.The evaluation factors were selected by using correlation analysis and variance expansion factor method.Applying four machine learning methods namely Logistic Regression(LR),Random Forest(RF),Support Vector Machines(SVM),and Extreme Gradient Boosting(XGB),landslide models were constructed.Comparative analysis and evaluation of the model were conducted through statistical indices and receiver operating characteristic(ROC)curves.The results showed that LR,RF,SVM,and XGB models have good predictive performance for landslide susceptibility,with the area under curve(AUC)values of 0.752,0.965,0.996,and 0.998,respectively.XGB model had the highest predictive ability,followed by RF model,SVM model,and LR model.The frequency ratio(FR)accuracy of LR,RF,SVM,and XGB models was 0.775,0.842,0.759,and 0.822,respectively.RF and XGB models were superior to LR and SVM models,indicating that the integrated algorithm has better predictive ability than a single classification algorithm in regional landslide classification problems.展开更多
Systematically determining the discriminatory power of various rainfall properties and their combinations in identifying debris flow occurrence is crucial for early warning systems.In this study,we evaluated the discr...Systematically determining the discriminatory power of various rainfall properties and their combinations in identifying debris flow occurrence is crucial for early warning systems.In this study,we evaluated the discriminatory power of different univariate and multivariate rainfall threshold models in identifying triggering conditions of debris flow in the Jiangjia Gully,Yunnan Province,China.The univariate models used single rainfall properties as indicators,including total rainfall(R_(tot)),rainfall duration(D),mean intensity(I_(mean)),absolute energy(Eabs),storm kinetic energy(E_(s)),antecedent rainfall(R_(a)),and maximum rainfall intensity over various durations(I_(max_dur)).The evaluation reveals that the I_(max_dur)and Eabs models have the best performance,followed by the E_(s),R_(tot),and I_(mean)models,while the D and R_(a)models have poor performances.Specifically,the I_(max_dur)model has the highest performance metrics at a 40-min duration.We used logistic regression to combine at least two rainfall properties to establish multivariate threshold models.The results show that adding D or R_(a)to the models dominated by Eabs,E_(s),R_(tot),or I_(mean)generally improve their performances,specifically when D is combined with I_(mean)or when R_(a)is combined with Eabs or E_(s).Including R_(a)in the I_(max_dur)model,it performs better than the univariate I_(max_dur)model.A power-law relationship between I_(max_dur)and R_(a)or between Eabs and R_(a)has better performance than the traditional I_(mean)–D model,while the performance of the E_(s)–R_(a)model is moderate.Our evaluation reemphasizes the important role of the maximum intensity over short durations in debris flow occurrence.It also highlights the importance of systematically investigating the role of R_(a)in establishing rainfall thresholds for triggering debris flow.Given the regional variations in rainfall patterns worldwide,it is necessary to evaluate the findings of this study across diverse watersheds.展开更多
Arc grounding faults occur frequently in the power grid with small resistance grounding neutral points.The existing arc fault identification technology only uses the fault line signal characteristics to set the identi...Arc grounding faults occur frequently in the power grid with small resistance grounding neutral points.The existing arc fault identification technology only uses the fault line signal characteristics to set the identification index,which leads to detection failure when the arc zero-off characteristic is short.To solve this problem,this paper presents an arc fault identification method by utilizing integrated signal characteristics of both the fault line and sound lines.Firstly,the waveform characteristics of the fault line and sound lines under an arc grounding fault are studied.After that,the convex hull,gradient product,and correlation coefficient index are used as the basic characteristic parameters to establish fault identification criteria.Then,the logistic regression algorithm is employed to deal with the reference samples,establish the machine discrimination model,and realize the discrimination of fault types.Finally,simulation test results and experimental results verify the accuracy of the proposed method.The comparison analysis shows that the proposed method has higher recognition accuracy,especially when the arc dissipation power is smaller than 2×10^(3) W,the zero-off period is not obvious.In conclusion,the proposed method expands the arc fault identification theory.展开更多
BACKGROUND Pediatric appendicitis is a common cause of abdominal pain in children and is recognized as a significant surgical emergency.A prompt and accurate diagnosis is essential to prevent complications such as per...BACKGROUND Pediatric appendicitis is a common cause of abdominal pain in children and is recognized as a significant surgical emergency.A prompt and accurate diagnosis is essential to prevent complications such as perforation and peritonitis.AIM To investigate the predictive value of the systemic immune-inflammation index(SII)combined with the pediatric appendicitis score(PAS)for the assessment of disease severity and surgical outcomes in children aged 5 years and older with appendicitis.METHODS Clinical data of 104 children diagnosed with acute appendicitis were analyzed.The participants were categorized into the acute appendicitis group and chronic appendicitis group based on disease presentation and further stratified into the good prognosis group and poor prognosis group based on prognosis.The SII and PAS were measured,and a joint model using the combined SII and PAS was constructed to predict disease severity and surgical outcomes.RESULTS Significant differences were observed in the SII and PAS parameters between the acute appendicitis group and chronic appendicitis group.Correlation analysis showed associations among the SII,PAS,and disease severity,with the combined SII and PAS model demonstrating significant predictive value for assessing disease severity[aera under the curve(AUC)=0.914]and predicting surgical outcomes(AUC=0.857)in children aged 5 years and older with appendicitis.CONCLUSION The study findings support the potential of integrating the SII with the PAS for assessing disease severity and predicting surgical outcomes in pediatric appendicitis,indicating the clinical utility of the combined SII and PAS model in guiding clinical decision-making and optimizing surgical management strategies for pediatric patients with appendicitis.展开更多
This is an erratum to an already published paper named“Establishment of a prediction model for prehospital return of spontaneous circulation in out-ofhospital patients with cardiac arrest”.We found errors in the aff...This is an erratum to an already published paper named“Establishment of a prediction model for prehospital return of spontaneous circulation in out-ofhospital patients with cardiac arrest”.We found errors in the affiliated institution of the authors.We apologize for our unintentional mistake.Please note,these changes do not affect our results.展开更多
“Human-elephant conflict(HEC)”,the alarming issue,in present day context has attracted the attention of environmentalists and policy makers.The rising conflict between human beings and wild elephants is common in Bu...“Human-elephant conflict(HEC)”,the alarming issue,in present day context has attracted the attention of environmentalists and policy makers.The rising conflict between human beings and wild elephants is common in Buxa Tiger Reserve(BTR)and its adjoining area in West Bengal State,India,making the area volatile.People’s attitudes towards elephant conservation activity are very crucial to get rid of HEC,because people’s proximity with wild elephants’habitat can trigger the occurrence of HEC.The aim of this study is to conduct an in-depth investigation about the association of people’s attitudes towards HEC with their locational,demographic,and socio-economic characteristics in BTR and its adjoining area by using Pearson’s bivariate chi-square test and binary logistic regression analysis.BTR is one of the constituent parts of Eastern Doors Elephant Reserve(EDER).We interviewed 500 respondents to understand their perceptions to HEC and investigated their locational,demographic,and socio-economic characteristics including location of village,gender,age,ethnicity,religion,caste,poverty level,education level,primary occupation,secondary occupation,household type,and source of firewood.The results indicate that respondents who are living in enclave forest villages(EFVs),peripheral forest villages(PFVs),corridor village(CVs),or forest and corridor villages(FCVs),mainly males,at the age of 18–48 years old,engaged with agriculture occupation,and living in kancha and mixed houses,have more likelihood to witness HEC.Besides,respondents who are illiterate or at primary education level are more likely to regard elephant as a main problematic animal around their villages and refuse to participate in elephant conservation activity.For the sake of a sustainable environment for both human beings and wildlife,people’s attitudes towards elephants must be friendly in a more prudent way,so that the two communities can live in harmony.展开更多
BACKGROUND Type 2 diabetes mellitus(T2DM)is a leading risk factor for the development and progression of chronic kidney disease(CKD).However,an accurate and con-venient marker for early detection and appropriate manag...BACKGROUND Type 2 diabetes mellitus(T2DM)is a leading risk factor for the development and progression of chronic kidney disease(CKD).However,an accurate and con-venient marker for early detection and appropriate management of CKD in in-dividuals with T2DM is limited.Recent studies have demonstrated a strong correlation between the neutrophil-to-lymphocyte ratio(NLR)and CKD.None-theless,the predictive value of NLR for renal damage in type 2 diabetic patients remains understudied.This study included 1040 adults aged 65 or older with T2DM from Shanghai's Community Health Service Center.The total number of neutrophils and lym-phocytes was detected,and NLR levels were calculated.CKD was defined as an estimated glomerular filtration rate≤60 mL/min/1.73 m².Participants were di-vided into four groups based on NLR levels.The clinical data and biochemical characteristics were compared among groups.A multivariate logistic regression model was used to analyze the association between NLR levels and CKD.RESULTS Significant differences were found in terms of sex,serum creatinine,blood urea nitrogen,total cholesterol,and low-density lipoprotein cholesterol among patients with T2DM in different NLR groups(P<0.0007).T2DM patients in the highest NLR quartile had a higher prevalence of CKD(P for trend=0.0011).Multivariate logistic regression analysis indicated that a high NLR was an independent risk factor for CKD in T2DM patients even after adjustment for important clinical and pathological parameters(P=0.0001,odds ratio=1.41,95%confidence intervals:1.18-1.68).CONCLUSION An elevated NLR in patients with T2DM is associated with higher prevalence of CKD,suggesting that it could be a marker for the detection and evaluation of diabetic kidney disease.展开更多
文摘In this paper, a logistical regression statistical analysis (LR) is presented for a set of variables used in experimental measurements in reversed field pinch (RFP) machines, commonly known as “slinky mode” (SM), observed to travel around the torus in Madison Symmetric Torus (MST). The LR analysis is used to utilize the modified Sine-Gordon dynamic equation model to predict with high confidence whether the slinky mode will lock or not lock when compared to the experimentally measured motion of the slinky mode. It is observed that under certain conditions, the slinky mode “locks” at or near the intersection of poloidal and/or toroidal gaps in MST. However, locked mode cease to travel around the torus;while unlocked mode keeps traveling without a change in the energy, making it hard to determine an exact set of conditions to predict locking/unlocking behaviour. The significant key model parameters determined by LR analysis are shown to improve the Sine-Gordon model’s ability to determine the locking/unlocking of magnetohydrodyamic (MHD) modes. The LR analysis of measured variables provides high confidence in anticipating locking versus unlocking of slinky mode proven by relational comparisons between simulations and the experimentally measured motion of the slinky mode in MST.
基金Under the auspices of National Natural Science Foundation of China(No.42101414)Natural Science Found for Outstanding Young Scholars in Jilin Province(No.20230508106RC)。
文摘The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate estimation of cropland burned area is both crucial and challenging,especially for the small and fragmented burned scars in China.Here we developed an automated burned area mapping algorithm that was implemented using Sentinel-2 Multi Spectral Instrument(MSI)data and its effectiveness was tested taking Songnen Plain,Northeast China as a case using satellite image of 2020.We employed a logistic regression method for integrating multiple spectral data into a synthetic indicator,and compared the results with manually interpreted burned area reference maps and the Moderate-Resolution Imaging Spectroradiometer(MODIS)MCD64A1 burned area product.The overall accuracy of the single variable logistic regression was 77.38%to 86.90%and 73.47%to 97.14%for the 52TCQ and 51TYM cases,respectively.In comparison,the accuracy of the burned area map was improved to 87.14%and 98.33%for the 52TCQ and 51TYM cases,respectively by multiple variable logistic regression of Sentind-2 images.The balance of omission error and commission error was also improved.The integration of multiple spectral data combined with a logistic regression method proves to be effective for burned area detection,offering a highly automated process with an automatic threshold determination mechanism.This method exhibits excellent extensibility and flexibility taking the image tile as the operating unit.It is suitable for burned area detection at a regional scale and can also be implemented with other satellite data.
文摘Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/approach:The study is based on Monte Carlo simulations.The methods are compared in terms of three measures of accuracy:specificity and two kinds of sensitivity.A loss function combining sensitivity and specificity is introduced and used for a final comparison.Findings:The choice of method depends on how much the users emphasize sensitivity against specificity.It also depends on the sample size.For a typical logistic regression setting with a moderate sample size and a small to moderate effect size,either BIC,BICc or Lasso seems to be optimal.Research limitations:Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data.Thus,more simulations are needed.Practical implications:Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper.Alternatively,they could run their own simulations and calculate the loss function.Originality/value:This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression.The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.
文摘In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.
文摘This research introduces a novel approach to improve and optimize the predictive capacity of consumer purchase behaviors on e-commerce platforms. This study presented an introduction to the fundamental concepts of the logistic regression algorithm. In addition, it analyzed user data obtained from an e-commerce platform. The original data were preprocessed, and a consumer purchase prediction model was developed for the e-commerce platform using the logistic regression method. The comparison study used the classic random forest approach, further enhanced by including the K-fold cross-validation method. Evaluation of the accuracy of the model’s classification was conducted using performance indicators that included the accuracy rate, the precision rate, the recall rate, and the F1 score. A visual examination determined the significance of the findings. The findings suggest that employing the logistic regression algorithm to forecast customer purchase behaviors on e-commerce platforms can improve the efficacy of the approach and yield more accurate predictions. This study serves as a valuable resource for improving the precision of forecasting customers’ purchase behaviors on e-commerce platforms. It has significant practical implications for optimizing the operational efficiency of e-commerce platforms.
文摘Internet of Things(IoT)is a popular social network in which devices are virtually connected for communicating and sharing information.This is applied greatly in business enterprises and government sectors for delivering the services to their customers,clients and citizens.But,the interaction is success-ful only based on the trust that each device has on another.Thus trust is very much essential for a social network.As Internet of Things have access over sen-sitive information,it urges to many threats that lead data management to risk.This issue is addressed by trust management that help to take decision about trust-worthiness of requestor and provider before communication and sharing.Several trust-based systems are existing for different domain using Dynamic weight meth-od,Fuzzy classification,Bayes inference and very few Regression analysis for IoT.The proposed algorithm is based on Logistic Regression,which provide strong statistical background to trust prediction.To make our stand strong on regression support to trust,we have compared the performance with equivalent sound Bayes analysis using Beta distribution.The performance is studied in simu-lated IoT setup with Quality of Service(QoS)and Social parameters for the nodes.The proposed model performs better in terms of various metrics.An IoT connects heterogeneous devices such as tags and sensor devices for sharing of information and avail different application services.The most salient features of IoT system is to design it with scalability,extendibility,compatibility and resiliency against attack.The existing worksfinds a way to integrate direct and indirect trust to con-verge quickly and estimate the bias due to attacks in addition to the above features.
基金The authors extend their appreciation to the Deputyship for Research&Innovation,Ministry of Education in Saudi Arabia for funding this research work through the Project Number(IF2-PSAU-2022/01/22043)。
文摘Autism spectrum disorder(ASD),classified as a developmental disability,is now more common in children than ever.A drastic increase in the rate of autism spectrum disorder in children worldwide demands early detection of autism in children.Parents can seek professional help for a better prognosis of the child’s therapy when ASD is diagnosed under five years.This research study aims to develop an automated tool for diagnosing autism in children.The computer-aided diagnosis tool for ASD detection is designed and developed by a novel methodology that includes data acquisition,feature selection,and classification phases.The most deterministic features are selected from the self-acquired dataset by novel feature selection methods before classification.The Imperialistic competitive algorithm(ICA)based on empires conquering colonies performs feature selection in this study.The performance of Logistic Regression(LR),Decision tree,K-Nearest Neighbor(KNN),and Random Forest(RF)classifiers are experimentally studied in this research work.The experimental results prove that the Logistic regression classifier exhibits the highest accuracy for the self-acquired dataset.The ASD detection is evaluated experimentally with the Least Absolute Shrinkage and Selection Operator(LASSO)feature selection method and different classifiers.The Exploratory Data Analysis(EDA)phase has uncovered crucial facts about the data,like the correlation of the features in the dataset with the class variable.
文摘This paper focuses on ozone prediction in the atmosphere using a machine learning approach. We utilize air pollutant and meteorological variable datasets from the El Paso area to classify ozone levels as high or low. The LR and ANN algorithms are employed to train the datasets. The models demonstrate a remarkably high classification accuracy of 89.3% in predicting ozone levels on a given day. Evaluation metrics reveal that both the ANN and LR models exhibit accuracies of 89.3% and 88.4%, respectively. Additionally, the AUC values for both models are comparable, with the ANN achieving 95.4% and the LR obtaining 95.2%. The lower the cross-entropy loss (log loss), the higher the model’s accuracy or performance. Our ANN model yields a log loss of 3.74, while the LR model shows a log loss of 6.03. The prediction time for the ANN model is approximately 0.00 seconds, whereas the LR model takes 0.02 seconds. Our odds ratio analysis indicates that features such as “Solar radiation”, “Std. Dev. Wind Direction”, “outdoor temperature”, “dew point temperature”, and “PM10” contribute to high ozone levels in El Paso, Texas. Based on metrics such as accuracy, error rate, log loss, and prediction time, the ANN model proves to be faster and more suitable for ozone classification in the El Paso, Texas area.
文摘In this paper, a weighted maximum likelihood technique (WMLT) for the logistic regression model is presented. This method depended on a weight function that is continuously adaptable using Mahalanobis distances for predictor variables. Under the model, the asymptotic consistency of the suggested estimator is demonstrated and properties of finite-sample are also investigated via simulation. In simulation studies and real data sets, it is observed that the newly proposed technique demonstrated the greatest performance among all estimators compared.
文摘This paper presents a case study on the IPUMS NHIS database,which provides data from censuses and surveys on the health of the U.S.population,including data related to COVID-19.By addressing gaps in previous studies,we propose a machine learning approach to train predictive models for identifying and measuring factors that affect the severity of COVID-19 symptoms.Our experiments focus on four groups of factors:demographic,socio-economic,health condition,and related to COVID-19 vaccination.By analysing the sensitivity of the variables used to train the models and the VEC(variable effect characteristics)analysis on the variable values,we identify and measure importance of various factors that influence the severity of COVID-19 symptoms.
文摘Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.
文摘BACKGROUND Orthodontic treatment can easily cause local soft tissue reactions in the oral cavity of patients under mechanical stress,leading to oral mucosal ulcers and affecting their quality of life.At present,only limited literature has explored the factors leading to oral ulcers in orthodontic treatment,and these research results are still controversial.AIM To investigate the current status and related factors of oral mucosal ulcers during orthodontic treatment,aiming to provide a valuable reference for preventing this disease in clinical practice.METHODS A total of 587 patients who underwent orthodontic treatment at the Peking University School of Stomatology and Hospital of Stomatology between 2020 and 2022 were selected and allocated to an observation or control group according to the incidence of oral mucosal ulcers during orthodontic therapy.A questionnaire survey was constructed to collect patient data,including basic information,lifestyle and eating habits,treatment details,mental factors,and trace element levels,and a comparative analysis of this data was performed between the two groups.RESULTS A logistic regression model with oral ulcers as the dependent variable was established.The regression results showed that age(≥60 years:odds ratio[OR]:6.820;95%confidence interval[CI]:2.226–20.893),smoking history(smoking:OR:4.434;95%CI:2.527–7.782),toothbrush hardness(hard:OR:2.804;95%CI:1.746–4.505),dietary temperature(hot diet:OR:1.399;95%CI:1.220–1.722),treatment course(>1 year:OR:3.830;95%CI:2.203–6.659),and tooth brushing frequency(>1 time per day:OR:0.228;95%CI:0.138–0.377)were independent factors for oral mucosal ulcers(P<0.05).Furthermore,Zn level(OR:0.945;95%CI:0.927–0.964)was a protective factor against oral ulcers,while the SAS(OR:1.284;95%CI:1.197–1.378)and SDS(OR:1.322;95%CI:1.231–1.419)scores were risk factors.CONCLUSION Age≥60 years,smoking history,hard toothbrush,hot diet,treatment course for>1 year,tooth brushing frequency of≤1 time per day,and mental anxiety are independent risk factors for oral mucosal ulcers.Therefore,these factors should receive clinical attention and be incorporated into the development and optimization of preventive strategies for reducing oral ulcer incidence.
基金supported by the National Basic Research Program"973"Project of the Ministry of Science and Technology of the People’s Republic of China(GrantNo.2013CB733202)theNational Key Technology R&D Program(Grant No.2011BAK12B01)+1 种基金the Young Foundation of NationalNatural Science of China(Grant No.41202210)the National Science Fund for DistinguishedYoung Scholars(Grant No.41225011)
文摘The primary objective of landslide susceptibility mapping is the prediction of potential landslides in landslide-prone areas. The predictive power of a landslide susceptibility mapping model could be tested in an adjacent area of similar geo- environmental conditions to find out the reliability. Both the 2oo8 Wenchuan Earthquake and the 2o13 Lushan Earthquake occurred in the Longmen Mountain seismic zone, with similar topographical and geological conditions. The two earthquakes are both featured by thrust fault and similar seismic mechanism This paper adopted the susceptibility mapping model of co-seismic landslides triggered by Wenchuan earthquake to predict the spatial distribution of landslides induced by Lushan earthquake. Six influencing parameters were taken into consideration: distance from the seismic fault, slope gradient, lithology, distance from drainage, elevation and Peak Ground Acceleration (PGA). The preliminary results suggested that the zones with high susceptibility of co- seismic landslides were mainly distributed in the mountainous areas of Lushan, Baoxing and Tianquan counties. The co-seismic landslide susceptibility map was completed in two days after the quake and sent to the field investigators to provide guidance for rescue and relief work. The predictive power of the susceptibility map was validated by ROC curve analysis method using 2o37 co-seismic landslides in the epicenter area. The AUC value of o.71o indicated that the susceptibility model derived from Wenchuan Earthquake landslides showed good accuracy inpredicting the landslides triggered by Lushan earthquake.
基金supported by the projects of the China Geological Survey(DD20221729,DD20190291)Zhuhai Urban Geological Survey(including informatization)(MZCD–2201–008).
文摘Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Conghua District,which is the most prone to landslide disasters in Guangzhou,was selected for landslide susceptibility evaluation.The evaluation factors were selected by using correlation analysis and variance expansion factor method.Applying four machine learning methods namely Logistic Regression(LR),Random Forest(RF),Support Vector Machines(SVM),and Extreme Gradient Boosting(XGB),landslide models were constructed.Comparative analysis and evaluation of the model were conducted through statistical indices and receiver operating characteristic(ROC)curves.The results showed that LR,RF,SVM,and XGB models have good predictive performance for landslide susceptibility,with the area under curve(AUC)values of 0.752,0.965,0.996,and 0.998,respectively.XGB model had the highest predictive ability,followed by RF model,SVM model,and LR model.The frequency ratio(FR)accuracy of LR,RF,SVM,and XGB models was 0.775,0.842,0.759,and 0.822,respectively.RF and XGB models were superior to LR and SVM models,indicating that the integrated algorithm has better predictive ability than a single classification algorithm in regional landslide classification problems.
基金supported by the National Key R&D Program of China(No.2023YFC3007205)the National Natural Science Foundation of China(Nos.42271013,42077440)Project of the Department of Science and Technology of Sichuan Province(No.2023ZHCG0012).
文摘Systematically determining the discriminatory power of various rainfall properties and their combinations in identifying debris flow occurrence is crucial for early warning systems.In this study,we evaluated the discriminatory power of different univariate and multivariate rainfall threshold models in identifying triggering conditions of debris flow in the Jiangjia Gully,Yunnan Province,China.The univariate models used single rainfall properties as indicators,including total rainfall(R_(tot)),rainfall duration(D),mean intensity(I_(mean)),absolute energy(Eabs),storm kinetic energy(E_(s)),antecedent rainfall(R_(a)),and maximum rainfall intensity over various durations(I_(max_dur)).The evaluation reveals that the I_(max_dur)and Eabs models have the best performance,followed by the E_(s),R_(tot),and I_(mean)models,while the D and R_(a)models have poor performances.Specifically,the I_(max_dur)model has the highest performance metrics at a 40-min duration.We used logistic regression to combine at least two rainfall properties to establish multivariate threshold models.The results show that adding D or R_(a)to the models dominated by Eabs,E_(s),R_(tot),or I_(mean)generally improve their performances,specifically when D is combined with I_(mean)or when R_(a)is combined with Eabs or E_(s).Including R_(a)in the I_(max_dur)model,it performs better than the univariate I_(max_dur)model.A power-law relationship between I_(max_dur)and R_(a)or between Eabs and R_(a)has better performance than the traditional I_(mean)–D model,while the performance of the E_(s)–R_(a)model is moderate.Our evaluation reemphasizes the important role of the maximum intensity over short durations in debris flow occurrence.It also highlights the importance of systematically investigating the role of R_(a)in establishing rainfall thresholds for triggering debris flow.Given the regional variations in rainfall patterns worldwide,it is necessary to evaluate the findings of this study across diverse watersheds.
基金This work was supported in part by the Natural Science Foundation of Henan Province,and the specific grant number is 232300420301。
文摘Arc grounding faults occur frequently in the power grid with small resistance grounding neutral points.The existing arc fault identification technology only uses the fault line signal characteristics to set the identification index,which leads to detection failure when the arc zero-off characteristic is short.To solve this problem,this paper presents an arc fault identification method by utilizing integrated signal characteristics of both the fault line and sound lines.Firstly,the waveform characteristics of the fault line and sound lines under an arc grounding fault are studied.After that,the convex hull,gradient product,and correlation coefficient index are used as the basic characteristic parameters to establish fault identification criteria.Then,the logistic regression algorithm is employed to deal with the reference samples,establish the machine discrimination model,and realize the discrimination of fault types.Finally,simulation test results and experimental results verify the accuracy of the proposed method.The comparison analysis shows that the proposed method has higher recognition accuracy,especially when the arc dissipation power is smaller than 2×10^(3) W,the zero-off period is not obvious.In conclusion,the proposed method expands the arc fault identification theory.
文摘BACKGROUND Pediatric appendicitis is a common cause of abdominal pain in children and is recognized as a significant surgical emergency.A prompt and accurate diagnosis is essential to prevent complications such as perforation and peritonitis.AIM To investigate the predictive value of the systemic immune-inflammation index(SII)combined with the pediatric appendicitis score(PAS)for the assessment of disease severity and surgical outcomes in children aged 5 years and older with appendicitis.METHODS Clinical data of 104 children diagnosed with acute appendicitis were analyzed.The participants were categorized into the acute appendicitis group and chronic appendicitis group based on disease presentation and further stratified into the good prognosis group and poor prognosis group based on prognosis.The SII and PAS were measured,and a joint model using the combined SII and PAS was constructed to predict disease severity and surgical outcomes.RESULTS Significant differences were observed in the SII and PAS parameters between the acute appendicitis group and chronic appendicitis group.Correlation analysis showed associations among the SII,PAS,and disease severity,with the combined SII and PAS model demonstrating significant predictive value for assessing disease severity[aera under the curve(AUC)=0.914]and predicting surgical outcomes(AUC=0.857)in children aged 5 years and older with appendicitis.CONCLUSION The study findings support the potential of integrating the SII with the PAS for assessing disease severity and predicting surgical outcomes in pediatric appendicitis,indicating the clinical utility of the combined SII and PAS model in guiding clinical decision-making and optimizing surgical management strategies for pediatric patients with appendicitis.
文摘This is an erratum to an already published paper named“Establishment of a prediction model for prehospital return of spontaneous circulation in out-ofhospital patients with cardiac arrest”.We found errors in the affiliated institution of the authors.We apologize for our unintentional mistake.Please note,these changes do not affect our results.
文摘“Human-elephant conflict(HEC)”,the alarming issue,in present day context has attracted the attention of environmentalists and policy makers.The rising conflict between human beings and wild elephants is common in Buxa Tiger Reserve(BTR)and its adjoining area in West Bengal State,India,making the area volatile.People’s attitudes towards elephant conservation activity are very crucial to get rid of HEC,because people’s proximity with wild elephants’habitat can trigger the occurrence of HEC.The aim of this study is to conduct an in-depth investigation about the association of people’s attitudes towards HEC with their locational,demographic,and socio-economic characteristics in BTR and its adjoining area by using Pearson’s bivariate chi-square test and binary logistic regression analysis.BTR is one of the constituent parts of Eastern Doors Elephant Reserve(EDER).We interviewed 500 respondents to understand their perceptions to HEC and investigated their locational,demographic,and socio-economic characteristics including location of village,gender,age,ethnicity,religion,caste,poverty level,education level,primary occupation,secondary occupation,household type,and source of firewood.The results indicate that respondents who are living in enclave forest villages(EFVs),peripheral forest villages(PFVs),corridor village(CVs),or forest and corridor villages(FCVs),mainly males,at the age of 18–48 years old,engaged with agriculture occupation,and living in kancha and mixed houses,have more likelihood to witness HEC.Besides,respondents who are illiterate or at primary education level are more likely to regard elephant as a main problematic animal around their villages and refuse to participate in elephant conservation activity.For the sake of a sustainable environment for both human beings and wildlife,people’s attitudes towards elephants must be friendly in a more prudent way,so that the two communities can live in harmony.
基金Supported by Health Commission of Baoshan District,Shanghai,China,No.BSJCPP-A-04 and No.BSZK-2023-T04the Science and Technology Commission of Baoshan District,Shanghai,China,No.20-E-63 and No.21-E-34.
文摘BACKGROUND Type 2 diabetes mellitus(T2DM)is a leading risk factor for the development and progression of chronic kidney disease(CKD).However,an accurate and con-venient marker for early detection and appropriate management of CKD in in-dividuals with T2DM is limited.Recent studies have demonstrated a strong correlation between the neutrophil-to-lymphocyte ratio(NLR)and CKD.None-theless,the predictive value of NLR for renal damage in type 2 diabetic patients remains understudied.This study included 1040 adults aged 65 or older with T2DM from Shanghai's Community Health Service Center.The total number of neutrophils and lym-phocytes was detected,and NLR levels were calculated.CKD was defined as an estimated glomerular filtration rate≤60 mL/min/1.73 m².Participants were di-vided into four groups based on NLR levels.The clinical data and biochemical characteristics were compared among groups.A multivariate logistic regression model was used to analyze the association between NLR levels and CKD.RESULTS Significant differences were found in terms of sex,serum creatinine,blood urea nitrogen,total cholesterol,and low-density lipoprotein cholesterol among patients with T2DM in different NLR groups(P<0.0007).T2DM patients in the highest NLR quartile had a higher prevalence of CKD(P for trend=0.0011).Multivariate logistic regression analysis indicated that a high NLR was an independent risk factor for CKD in T2DM patients even after adjustment for important clinical and pathological parameters(P=0.0001,odds ratio=1.41,95%confidence intervals:1.18-1.68).CONCLUSION An elevated NLR in patients with T2DM is associated with higher prevalence of CKD,suggesting that it could be a marker for the detection and evaluation of diabetic kidney disease.