Landslide susceptibility mapping is the first step in regional hazard management as it helps to understand the spatial distribution of the probability of slope failure in an area.An attempt is made to map the landslid...Landslide susceptibility mapping is the first step in regional hazard management as it helps to understand the spatial distribution of the probability of slope failure in an area.An attempt is made to map the landslide susceptibility in Tevankarai Ar subwatershed,Kodaikkanal,India using binary logistic regression analysis.Geographic Information System is used to prepare the database of the predictor variables and landslide inventory map,which is used to build the spatial model of landslide susceptibility.The model describes the relationship between the dependent variable(presence and absence of landslide) and the independent variables selected for study(predictor variables) by the best fitting function.A forward stepwise logistic regression model using maximum likelihood estimation is used in the regression analysis.An inventory of 84 landslides and cells within a buffer distance of 10m around the landslide is used as the dependent variable.Relief,slope,aspect,plan curvature,profile curvature,land use,soil,topographic wetness index,proximity to roads and proximity to lineaments are taken as independent variables.The constant and the coefficient of the predictor variable retained by the regression model are used to calculate the probability of slope failure and analyze the effect of each predictor variable on landslide occurrence in thestudy area.The model shows that the most significant parameter contributing to landslides is slope.The other significant parameters are profile curvature,soil,road,wetness index and relief.The predictive logistic regression model is validated using temporal validation data-set of known landslide locations and shows an accuracy of 85.29 %.展开更多
Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/appr...Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/approach:The study is based on Monte Carlo simulations.The methods are compared in terms of three measures of accuracy:specificity and two kinds of sensitivity.A loss function combining sensitivity and specificity is introduced and used for a final comparison.Findings:The choice of method depends on how much the users emphasize sensitivity against specificity.It also depends on the sample size.For a typical logistic regression setting with a moderate sample size and a small to moderate effect size,either BIC,BICc or Lasso seems to be optimal.Research limitations:Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data.Thus,more simulations are needed.Practical implications:Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper.Alternatively,they could run their own simulations and calculate the loss function.Originality/value:This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression.The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.展开更多
In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste...In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.展开更多
The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate ...The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate estimation of cropland burned area is both crucial and challenging,especially for the small and fragmented burned scars in China.Here we developed an automated burned area mapping algorithm that was implemented using Sentinel-2 Multi Spectral Instrument(MSI)data and its effectiveness was tested taking Songnen Plain,Northeast China as a case using satellite image of 2020.We employed a logistic regression method for integrating multiple spectral data into a synthetic indicator,and compared the results with manually interpreted burned area reference maps and the Moderate-Resolution Imaging Spectroradiometer(MODIS)MCD64A1 burned area product.The overall accuracy of the single variable logistic regression was 77.38%to 86.90%and 73.47%to 97.14%for the 52TCQ and 51TYM cases,respectively.In comparison,the accuracy of the burned area map was improved to 87.14%and 98.33%for the 52TCQ and 51TYM cases,respectively by multiple variable logistic regression of Sentind-2 images.The balance of omission error and commission error was also improved.The integration of multiple spectral data combined with a logistic regression method proves to be effective for burned area detection,offering a highly automated process with an automatic threshold determination mechanism.This method exhibits excellent extensibility and flexibility taking the image tile as the operating unit.It is suitable for burned area detection at a regional scale and can also be implemented with other satellite data.展开更多
In this paper, a logistical regression statistical analysis (LR) is presented for a set of variables used in experimental measurements in reversed field pinch (RFP) machines, commonly known as “slinky mode” (SM), ob...In this paper, a logistical regression statistical analysis (LR) is presented for a set of variables used in experimental measurements in reversed field pinch (RFP) machines, commonly known as “slinky mode” (SM), observed to travel around the torus in Madison Symmetric Torus (MST). The LR analysis is used to utilize the modified Sine-Gordon dynamic equation model to predict with high confidence whether the slinky mode will lock or not lock when compared to the experimentally measured motion of the slinky mode. It is observed that under certain conditions, the slinky mode “locks” at or near the intersection of poloidal and/or toroidal gaps in MST. However, locked mode cease to travel around the torus;while unlocked mode keeps traveling without a change in the energy, making it hard to determine an exact set of conditions to predict locking/unlocking behaviour. The significant key model parameters determined by LR analysis are shown to improve the Sine-Gordon model’s ability to determine the locking/unlocking of magnetohydrodyamic (MHD) modes. The LR analysis of measured variables provides high confidence in anticipating locking versus unlocking of slinky mode proven by relational comparisons between simulations and the experimentally measured motion of the slinky mode in MST.展开更多
On the first anniversary of the implementation of the new regulations of Beijing Municipality on the management of domestic waste,to understand residents’views on the waste classification policy,the project conducted...On the first anniversary of the implementation of the new regulations of Beijing Municipality on the management of domestic waste,to understand residents’views on the waste classification policy,the project conducted relevant investigation of the satisfaction of residents with the domestic waste classification policy in Daxing District of Beijing,China.Based on the analysis of the survey,this study uses the binary logistic regression model to explore the residents’satisfaction with the new domestic waste classification policy in Beijing and its influencing factors.The data from 398 valid questionnaires involve the demographic characteristics of residents,residents’cognition and views on Beijing municipal solid waste classification policy,and residents’satisfaction with Beijing domestic waste classification policy.The data show that the comprehensive satisfaction level of residents with the domestic waste classification policy in Beijing is quite high,up to 84.7%.Among them,the satisfaction level of residents with the details of the classification standards,the allocation of garbage cans,the publicity and supervision of the policy,incentive measures and the implementation process and effect of the policy is very high,exceeding 80%or even more than 90%.Through binary logistic regression analysis,we come to the conclusion that six factors significantly affect residents’satisfaction with Beijing municipal solid waste classification policy,such as residents’monthly income,household daily average domestic waste production,publicity of waste classification policy,supervisors’better understanding of waste classification standards,guidance of waste delivery by community classification supervisors,and convenience of waste classification process.展开更多
Despite concerted efforts to create employment opportunities and the realized economic growth between 2000 and 2005, the unemployment rate in Namibia currently stands at 27.4%, according to the Labour Force Survey rel...Despite concerted efforts to create employment opportunities and the realized economic growth between 2000 and 2005, the unemployment rate in Namibia currently stands at 27.4%, according to the Labour Force Survey released in April 2013. The percentage of employed males in Namibia stands at 41.6% while that of employed females stand at 28.8% according to the National Human Resources Plan of May 2013. Analysts have put the blame on adverse climatic conditions, limited levels of skills, access to finance, and the structure of the economy. The frustration and discomfort caused by unemployment, especially among the youth, can threaten the country's peace and stability as it negatively impacts on the standard of living, crime rates, family happiness, and drug abuse.To date, studies on employment in Namibia have mainly concentrated on the micro and macro econometric approaches. It is important to examine how bio-demographic characteristics affect employment. This paper uses data from the 2010 Income and expenditure survey to establish the bio-demographic determinants of employment by fitting a binary logistic model. The outcome variable is employment status which is dichotomous. The independent variables which were guided by review of related literature and availability of data in the Income and Expenditure survey data set, included age-group, region, place of residence, marital status, education level, and gender. Results indicated that employment prospects in Namibia were influenced by the region, gender, marital status, and education level.展开更多
The risk factors of high trait anger of juvenile offenders were explored through questionnaire study in a youth correctional facility of Hubei province, China. A total of 1090 juvenile offenders in Hubei province were...The risk factors of high trait anger of juvenile offenders were explored through questionnaire study in a youth correctional facility of Hubei province, China. A total of 1090 juvenile offenders in Hubei province were investigated by self-compiled social-demographic questionnaire, Childhood Trauma Questionnaire(CTQ), and State-Trait Anger Expression Inventory-Ⅱ(STAXI-Ⅱ). The risk factors were analyzed by chi-square tests, correlation analysis, and binary logistic regression analysis with SPSS 19.0. A total of 1082 copies of valid questionnaires were collected. High trait anger group(n=316) was defined as those who scored in the upper 27 th percentile of STAXI-Ⅱ trait anger scale(TAS), and the rest were defined as low trait anger group(n=766). The risk factors associated with high level of trait anger included: childhood emotional abuse, childhood sexual abuse, step family, frequent drug abuse, and frequent internet using(P〈0.05 or P〈0.01). Birth sequence, number of sibling, ranking in the family, identity of the main care-taker, the education level of care-taker, educational style of care-taker, family income, relationship between parents, social atmosphere of local area, frequent drinking, and frequent smoking did not predict to high level of trait anger(P〉0.05). It was suggested that traumatic experience in childhood and unhealthy life style may significantly increase the level of trait anger in adulthood. The risk factors of high trait anger and their effects should be taken into consideration seriously.展开更多
目的:比较决策树和Logistic回归模型对体外受精-胚胎移植(in vitro fertilization and embryo transfer,IVF-ET)患者妊娠结局的预测价值。方法:纳入2021年1月至2022年10月在长治医学院附属和平医院接受IVF-ET的患者350例为研究对象,根...目的:比较决策树和Logistic回归模型对体外受精-胚胎移植(in vitro fertilization and embryo transfer,IVF-ET)患者妊娠结局的预测价值。方法:纳入2021年1月至2022年10月在长治医学院附属和平医院接受IVF-ET的患者350例为研究对象,根据妊娠结局分为妊娠成功组(215例)和妊娠失败组(135例)。收集患者临床资料,建立IVF-ET患者妊娠结局Logistic回归和决策树预测模型,并在是否基于Logistic回归结果条件下建立决策树分析模型(决策树1和决策树2),采用受试者工作特征(receiver operating characteristic,ROC)曲线对模型预测效果进行评价。结果:350例患者中,妊娠成功患者占61.43%,妊娠失败者占38.57%。妊娠失败组年龄≥35岁、不孕年限≥5年、周期次数≥1次、有心理精神障碍的患者比例及HCG日血清孕酮水平均高于妊娠成功组,获卵数≥10枚、受精率≥75%的患者比例及HCG日子宫内膜厚度、优质胚胎数小于妊娠成功组(P<0.05)。多因素Logistic回归分析结果显示,年龄、HCG日血清孕酮水平、优质胚胎数及心理精神障碍均是IVF-ET患者妊娠结局的影响因素(P<0.05)。决策树模型显示,年龄、HCG日血清孕酮水平、优质胚胎数为IVF-ET患者妊娠结局的影响因素。Logistic回归模型曲线下面积(area under curve,AUC)为0.832,预测敏感度、特异度和准确度分别为87.3%、71.4%、83.5%;决策树1的AUC为0.859,预测敏感度、特异度和准确度分别为85.1%、76.8%、85.6%;决策树2的AUC为0.820,预测敏感度、特异度和准确度分别为83.7%、73.2%、82.4%。决策树1的AUC大于决策树2(P<0.05),但与Logistic回归模型的AUC比较差异无统计学意义(P>0.05)。结论:Logistic回归模型和决策树模型对于IVF-ET患者妊娠结局均有一定的预测价值。展开更多
Internal solitary wave propagation over a submarine ridge results in energy dissipation, in which the hydrodynamic interaction between a wave and ridge affects marine environment. This study analyzes the effects of ri...Internal solitary wave propagation over a submarine ridge results in energy dissipation, in which the hydrodynamic interaction between a wave and ridge affects marine environment. This study analyzes the effects of ridge height and potential energy during wave-ridge interaction with a binary and cumulative logistic regression model. In testing the Global Null Hypothesis, all values are p 〈0.001, with three statistical methods, such as Likelihood Ratio, Score, and Wald. While comparing with two kinds of models, tests values obtained by cumulative logistic regression models are better than those by binary logistic regression models. Although this study employed cumulative logistic regression model, three probability functions p^1, p^2 and p^3, are utilized for investigating the weighted influence of factors on wave reflection. Deviance and Pearson tests are applied to cheek the goodness-of-fit of the proposed model. The analytical results demonstrated that both ridge height (X1 ) and potential energy (X2 ) significantly impact (p 〈 0. 0001 ) the amplitude-based refleeted rate; the P-values for the deviance and Pearson are all 〉 0.05 (0.2839, 0.3438, respectively). That is, the goodness-of-fit between ridge height ( X1 ) and potential energy (X2) can further predict parameters under the scenario of the best parsimonious model. Investigation of 6 predictive powers ( R2, Max-rescaled R^2, Sorners' D, Gamma, Tau-a, and c, respectively) indicate that these predictive estimates of the proposed model have better predictive ability than ridge height alone, and are very similar to the interaction of ridge height and potential energy. It can be concluded that the goodness-of-fit and prediction ability of the cumulative logistic regression model are better than that of the binary logistic regression model.展开更多
Landslide susceptibility maps(LSMs) play a vital role in assisting land use planning and risk mitigation. This study aims to optimize causative factors using logistic regression(LR) and an artificial neural network(AN...Landslide susceptibility maps(LSMs) play a vital role in assisting land use planning and risk mitigation. This study aims to optimize causative factors using logistic regression(LR) and an artificial neural network(ANN) to produce a LSM. The LSM is produced with 11 causative factors and then optimized using forward-stepwise LR(FSLR), ANN, and their combination(FSLR-ANN) until eight causative factors were found for each method. The ANN method produced superior validation results compared with LR. The ROC values for the training data set ranges between 0.8 and 0.9. On the other hand, validation with the percentage of landslide fall into LSM class high and very high, ANN method was higher(92.59%) than LR(82.12%). FSLR-ANN with nine causative factors gave the best validation results with respect to area under curve(AUC) values, and validation with the percentage of landslide fall into LSM class high and very high. In conclusion, ANN was found to be better than LR when producing LSMs. The best Optimization was combination of FSLR-ANN with nine causative factors and AUC success rate 0.847, predictive rate 0.844 and validation with landslide fall into high and very high class with 91.30%. It is an encouraging preliminary model towards a systematic introduction of FSLR-ANN model for optimization causative factors in landslide susceptibility assessment in the mountainous area of Ujung Loe Watershed.展开更多
文摘Landslide susceptibility mapping is the first step in regional hazard management as it helps to understand the spatial distribution of the probability of slope failure in an area.An attempt is made to map the landslide susceptibility in Tevankarai Ar subwatershed,Kodaikkanal,India using binary logistic regression analysis.Geographic Information System is used to prepare the database of the predictor variables and landslide inventory map,which is used to build the spatial model of landslide susceptibility.The model describes the relationship between the dependent variable(presence and absence of landslide) and the independent variables selected for study(predictor variables) by the best fitting function.A forward stepwise logistic regression model using maximum likelihood estimation is used in the regression analysis.An inventory of 84 landslides and cells within a buffer distance of 10m around the landslide is used as the dependent variable.Relief,slope,aspect,plan curvature,profile curvature,land use,soil,topographic wetness index,proximity to roads and proximity to lineaments are taken as independent variables.The constant and the coefficient of the predictor variable retained by the regression model are used to calculate the probability of slope failure and analyze the effect of each predictor variable on landslide occurrence in thestudy area.The model shows that the most significant parameter contributing to landslides is slope.The other significant parameters are profile curvature,soil,road,wetness index and relief.The predictive logistic regression model is validated using temporal validation data-set of known landslide locations and shows an accuracy of 85.29 %.
文摘Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/approach:The study is based on Monte Carlo simulations.The methods are compared in terms of three measures of accuracy:specificity and two kinds of sensitivity.A loss function combining sensitivity and specificity is introduced and used for a final comparison.Findings:The choice of method depends on how much the users emphasize sensitivity against specificity.It also depends on the sample size.For a typical logistic regression setting with a moderate sample size and a small to moderate effect size,either BIC,BICc or Lasso seems to be optimal.Research limitations:Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data.Thus,more simulations are needed.Practical implications:Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper.Alternatively,they could run their own simulations and calculate the loss function.Originality/value:This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression.The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.
文摘In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.
基金Under the auspices of National Natural Science Foundation of China(No.42101414)Natural Science Found for Outstanding Young Scholars in Jilin Province(No.20230508106RC)。
文摘The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate estimation of cropland burned area is both crucial and challenging,especially for the small and fragmented burned scars in China.Here we developed an automated burned area mapping algorithm that was implemented using Sentinel-2 Multi Spectral Instrument(MSI)data and its effectiveness was tested taking Songnen Plain,Northeast China as a case using satellite image of 2020.We employed a logistic regression method for integrating multiple spectral data into a synthetic indicator,and compared the results with manually interpreted burned area reference maps and the Moderate-Resolution Imaging Spectroradiometer(MODIS)MCD64A1 burned area product.The overall accuracy of the single variable logistic regression was 77.38%to 86.90%and 73.47%to 97.14%for the 52TCQ and 51TYM cases,respectively.In comparison,the accuracy of the burned area map was improved to 87.14%and 98.33%for the 52TCQ and 51TYM cases,respectively by multiple variable logistic regression of Sentind-2 images.The balance of omission error and commission error was also improved.The integration of multiple spectral data combined with a logistic regression method proves to be effective for burned area detection,offering a highly automated process with an automatic threshold determination mechanism.This method exhibits excellent extensibility and flexibility taking the image tile as the operating unit.It is suitable for burned area detection at a regional scale and can also be implemented with other satellite data.
文摘In this paper, a logistical regression statistical analysis (LR) is presented for a set of variables used in experimental measurements in reversed field pinch (RFP) machines, commonly known as “slinky mode” (SM), observed to travel around the torus in Madison Symmetric Torus (MST). The LR analysis is used to utilize the modified Sine-Gordon dynamic equation model to predict with high confidence whether the slinky mode will lock or not lock when compared to the experimentally measured motion of the slinky mode. It is observed that under certain conditions, the slinky mode “locks” at or near the intersection of poloidal and/or toroidal gaps in MST. However, locked mode cease to travel around the torus;while unlocked mode keeps traveling without a change in the energy, making it hard to determine an exact set of conditions to predict locking/unlocking behaviour. The significant key model parameters determined by LR analysis are shown to improve the Sine-Gordon model’s ability to determine the locking/unlocking of magnetohydrodyamic (MHD) modes. The LR analysis of measured variables provides high confidence in anticipating locking versus unlocking of slinky mode proven by relational comparisons between simulations and the experimentally measured motion of the slinky mode in MST.
基金supported by the National College Students Innovation and Entrepreneurship Training Programs(CN)(Grant Nos.2021J00054&2019J00127)
文摘On the first anniversary of the implementation of the new regulations of Beijing Municipality on the management of domestic waste,to understand residents’views on the waste classification policy,the project conducted relevant investigation of the satisfaction of residents with the domestic waste classification policy in Daxing District of Beijing,China.Based on the analysis of the survey,this study uses the binary logistic regression model to explore the residents’satisfaction with the new domestic waste classification policy in Beijing and its influencing factors.The data from 398 valid questionnaires involve the demographic characteristics of residents,residents’cognition and views on Beijing municipal solid waste classification policy,and residents’satisfaction with Beijing domestic waste classification policy.The data show that the comprehensive satisfaction level of residents with the domestic waste classification policy in Beijing is quite high,up to 84.7%.Among them,the satisfaction level of residents with the details of the classification standards,the allocation of garbage cans,the publicity and supervision of the policy,incentive measures and the implementation process and effect of the policy is very high,exceeding 80%or even more than 90%.Through binary logistic regression analysis,we come to the conclusion that six factors significantly affect residents’satisfaction with Beijing municipal solid waste classification policy,such as residents’monthly income,household daily average domestic waste production,publicity of waste classification policy,supervisors’better understanding of waste classification standards,guidance of waste delivery by community classification supervisors,and convenience of waste classification process.
文摘Despite concerted efforts to create employment opportunities and the realized economic growth between 2000 and 2005, the unemployment rate in Namibia currently stands at 27.4%, according to the Labour Force Survey released in April 2013. The percentage of employed males in Namibia stands at 41.6% while that of employed females stand at 28.8% according to the National Human Resources Plan of May 2013. Analysts have put the blame on adverse climatic conditions, limited levels of skills, access to finance, and the structure of the economy. The frustration and discomfort caused by unemployment, especially among the youth, can threaten the country's peace and stability as it negatively impacts on the standard of living, crime rates, family happiness, and drug abuse.To date, studies on employment in Namibia have mainly concentrated on the micro and macro econometric approaches. It is important to examine how bio-demographic characteristics affect employment. This paper uses data from the 2010 Income and expenditure survey to establish the bio-demographic determinants of employment by fitting a binary logistic model. The outcome variable is employment status which is dichotomous. The independent variables which were guided by review of related literature and availability of data in the Income and Expenditure survey data set, included age-group, region, place of residence, marital status, education level, and gender. Results indicated that employment prospects in Namibia were influenced by the region, gender, marital status, and education level.
基金supported by National Natural Science Foundation of China(No.81373022)
文摘The risk factors of high trait anger of juvenile offenders were explored through questionnaire study in a youth correctional facility of Hubei province, China. A total of 1090 juvenile offenders in Hubei province were investigated by self-compiled social-demographic questionnaire, Childhood Trauma Questionnaire(CTQ), and State-Trait Anger Expression Inventory-Ⅱ(STAXI-Ⅱ). The risk factors were analyzed by chi-square tests, correlation analysis, and binary logistic regression analysis with SPSS 19.0. A total of 1082 copies of valid questionnaires were collected. High trait anger group(n=316) was defined as those who scored in the upper 27 th percentile of STAXI-Ⅱ trait anger scale(TAS), and the rest were defined as low trait anger group(n=766). The risk factors associated with high level of trait anger included: childhood emotional abuse, childhood sexual abuse, step family, frequent drug abuse, and frequent internet using(P〈0.05 or P〈0.01). Birth sequence, number of sibling, ranking in the family, identity of the main care-taker, the education level of care-taker, educational style of care-taker, family income, relationship between parents, social atmosphere of local area, frequent drinking, and frequent smoking did not predict to high level of trait anger(P〉0.05). It was suggested that traumatic experience in childhood and unhealthy life style may significantly increase the level of trait anger in adulthood. The risk factors of high trait anger and their effects should be taken into consideration seriously.
文摘目的:比较决策树和Logistic回归模型对体外受精-胚胎移植(in vitro fertilization and embryo transfer,IVF-ET)患者妊娠结局的预测价值。方法:纳入2021年1月至2022年10月在长治医学院附属和平医院接受IVF-ET的患者350例为研究对象,根据妊娠结局分为妊娠成功组(215例)和妊娠失败组(135例)。收集患者临床资料,建立IVF-ET患者妊娠结局Logistic回归和决策树预测模型,并在是否基于Logistic回归结果条件下建立决策树分析模型(决策树1和决策树2),采用受试者工作特征(receiver operating characteristic,ROC)曲线对模型预测效果进行评价。结果:350例患者中,妊娠成功患者占61.43%,妊娠失败者占38.57%。妊娠失败组年龄≥35岁、不孕年限≥5年、周期次数≥1次、有心理精神障碍的患者比例及HCG日血清孕酮水平均高于妊娠成功组,获卵数≥10枚、受精率≥75%的患者比例及HCG日子宫内膜厚度、优质胚胎数小于妊娠成功组(P<0.05)。多因素Logistic回归分析结果显示,年龄、HCG日血清孕酮水平、优质胚胎数及心理精神障碍均是IVF-ET患者妊娠结局的影响因素(P<0.05)。决策树模型显示,年龄、HCG日血清孕酮水平、优质胚胎数为IVF-ET患者妊娠结局的影响因素。Logistic回归模型曲线下面积(area under curve,AUC)为0.832,预测敏感度、特异度和准确度分别为87.3%、71.4%、83.5%;决策树1的AUC为0.859,预测敏感度、特异度和准确度分别为85.1%、76.8%、85.6%;决策树2的AUC为0.820,预测敏感度、特异度和准确度分别为83.7%、73.2%、82.4%。决策树1的AUC大于决策树2(P<0.05),但与Logistic回归模型的AUC比较差异无统计学意义(P>0.05)。结论:Logistic回归模型和决策树模型对于IVF-ET患者妊娠结局均有一定的预测价值。
基金This paper was financially supported by NSC96-2628-E-366-004-MY2 and NSC96-2628-E-132-001-MY2
文摘Internal solitary wave propagation over a submarine ridge results in energy dissipation, in which the hydrodynamic interaction between a wave and ridge affects marine environment. This study analyzes the effects of ridge height and potential energy during wave-ridge interaction with a binary and cumulative logistic regression model. In testing the Global Null Hypothesis, all values are p 〈0.001, with three statistical methods, such as Likelihood Ratio, Score, and Wald. While comparing with two kinds of models, tests values obtained by cumulative logistic regression models are better than those by binary logistic regression models. Although this study employed cumulative logistic regression model, three probability functions p^1, p^2 and p^3, are utilized for investigating the weighted influence of factors on wave reflection. Deviance and Pearson tests are applied to cheek the goodness-of-fit of the proposed model. The analytical results demonstrated that both ridge height (X1 ) and potential energy (X2 ) significantly impact (p 〈 0. 0001 ) the amplitude-based refleeted rate; the P-values for the deviance and Pearson are all 〉 0.05 (0.2839, 0.3438, respectively). That is, the goodness-of-fit between ridge height ( X1 ) and potential energy (X2) can further predict parameters under the scenario of the best parsimonious model. Investigation of 6 predictive powers ( R2, Max-rescaled R^2, Sorners' D, Gamma, Tau-a, and c, respectively) indicate that these predictive estimates of the proposed model have better predictive ability than ridge height alone, and are very similar to the interaction of ridge height and potential energy. It can be concluded that the goodness-of-fit and prediction ability of the cumulative logistic regression model are better than that of the binary logistic regression model.
文摘Landslide susceptibility maps(LSMs) play a vital role in assisting land use planning and risk mitigation. This study aims to optimize causative factors using logistic regression(LR) and an artificial neural network(ANN) to produce a LSM. The LSM is produced with 11 causative factors and then optimized using forward-stepwise LR(FSLR), ANN, and their combination(FSLR-ANN) until eight causative factors were found for each method. The ANN method produced superior validation results compared with LR. The ROC values for the training data set ranges between 0.8 and 0.9. On the other hand, validation with the percentage of landslide fall into LSM class high and very high, ANN method was higher(92.59%) than LR(82.12%). FSLR-ANN with nine causative factors gave the best validation results with respect to area under curve(AUC) values, and validation with the percentage of landslide fall into LSM class high and very high. In conclusion, ANN was found to be better than LR when producing LSMs. The best Optimization was combination of FSLR-ANN with nine causative factors and AUC success rate 0.847, predictive rate 0.844 and validation with landslide fall into high and very high class with 91.30%. It is an encouraging preliminary model towards a systematic introduction of FSLR-ANN model for optimization causative factors in landslide susceptibility assessment in the mountainous area of Ujung Loe Watershed.