The North China Plain and the agricultural region are crossed by the Shanxi-Beijing natural gas pipeline.Resi-dents in the area use rototillers for planting and harvesting;however,the depth of the rototillers into the...The North China Plain and the agricultural region are crossed by the Shanxi-Beijing natural gas pipeline.Resi-dents in the area use rototillers for planting and harvesting;however,the depth of the rototillers into the ground is greater than the depth of the pipeline,posing a significant threat to the safe operation of the pipeline.Therefore,it is of great significance to study the dynamic response of rotary tillers impacting pipelines to ensure the safe opera-tion of pipelines.This article focuses on the Shanxi-Beijing natural gas pipeline,utilizingfinite element simulation software to establish afinite element model for the interaction among the machinery,pipeline,and soil,and ana-lyzing the dynamic response of the pipeline.At the same time,a decision tree model is introduced to classify the damage of pipelines under different working conditions,and the boundary value and importance of each influen-cing factor on pipeline damage are derived.Considering the actual conditions in the hemp yam planting area,targeted management measures have been proposed to ensure the operational safety of the Shanxi-Beijing natural gas pipeline in this region.展开更多
Research on the quality of data in a structural calculation document(SCD)is lacking,although the SCD ofa bridge is used as an essential reference during the entire lifecycle of the facility.XML Schema matching enables...Research on the quality of data in a structural calculation document(SCD)is lacking,although the SCD ofa bridge is used as an essential reference during the entire lifecycle of the facility.XML Schema matching enables qualitative improvement of the stored data.This study aimed to enhance the applicability of XML Schema matching,which improves the speed and quality of information stored in bridge SCDs.First,the authors proposed a method of reducing the computing time for the schema matching of bridge SCDs.The computing speed of schema matching was increased by 13 to 1800 times by reducing the checking process of the correlations.Second,the authors developed a heuristic solution for selecting the optimal weight factors used in the matching process to maintain a high accuracy by introducing a decision tree.The decision tree model was built using the content elements stored in the SCD,design companies,bridge types,and weight factors as input variables,and the matching accuracy as the target variable.The inverse-calculation method was applied to extract the weight factors from the decision tree model for high-accuracy schema matching results.展开更多
Gully erosion is a disruptive phenomenon which extensively affects the Iranian territory,especially in the Northern provinces.A number of studies have been recently undertaken to study this process and to predict it o...Gully erosion is a disruptive phenomenon which extensively affects the Iranian territory,especially in the Northern provinces.A number of studies have been recently undertaken to study this process and to predict it over space and ultimately,in a broader national effort,to limit its negative effects on local communities.We focused on the Bastam watershed where 9.3%of its surface is currently affected by gullying.Machine learning algorithms are currently under the magnifying glass across the geomorphological community for their high predictive ability.However,unlike the bivariate statistical models,their structure does not provide intuitive and quantifiable measures of environmental preconditioning factors.To cope with such weakness,we interpret preconditioning causes on the basis of a bivariate approach namely,Index of Entropy.And,we performed the susceptibility mapping procedure by testing three extensions of a decision tree model namely,Alternating Decision Tree(ADTree),Naive-Bayes tree(NBTree),and Logistic Model Tree(LMT).We dichotomized the gully information over space into gully presence/absence conditions,which we further explored in their calibration and validation stages.Being the presence/absence information and associated factors identical,the resulting differences are only due to the algorithmic structures of the three models we chose.Such differences are not significant in terms of performances;in fact,the three models produce outstanding predictive AUC measures(ADTree=0.922;NBTree=0.939;LMT=0.944).However,the associated mapping results depict very different patterns where only the LMT is associated with reasonable susceptibility patterns.This is a strong indication of what model combines best performance and mapping for any natural hazard-oriented application.展开更多
Students in South African Universities come from different socio-cultural backgrounds, countries and high schools. This suggests that these students have different experiences which impact on their levels of grasping ...Students in South African Universities come from different socio-cultural backgrounds, countries and high schools. This suggests that these students have different experiences which impact on their levels of grasping information in class as they potentially use different lenses on tuition. The current practice in Universities in contributing to the academic performance of students includes the use of tutors, the use of mobile devices for first year students, use of student assistants and the use of different feedback measures. What is problematic about the current practice is that students are quitting university in high numbers. In this study, knowledge has been drawn from data through the use of machine learning algorithms. Bayesian networks, support vector machines (SVMs) and decision trees algorithms were used individually in this work to construct predictive models for the academic performance of students. The best model was constructed using SVM and it gave a prediction of 72.87% and a prediction cost of 139. The model does predict the performance of students in advance of the year-end examinations outcome. The results suggest that South African Universities must recognize the diversity in student population and thus provide students with better support and equip them with the necessary knowledge that will enable them to tap into their full potential and thus enhance their skills.展开更多
BACKGROUND With the aging world population,the incidence of falls has intensified and fallrelated hospitalization costs are increasing.Falls are one type of event studied in the health economics of patient safety,and ...BACKGROUND With the aging world population,the incidence of falls has intensified and fallrelated hospitalization costs are increasing.Falls are one type of event studied in the health economics of patient safety,and many developed countries have conducted such research on fall-related hospitalization costs.However,China,a developing country,still lacks large-scale studies in this area.AIM To investigate the factors related to the hospitalization costs of fall-related injuries in elderly inpatients and establish factor-based,cost-related groupings.METHODS A retrospective study was conducted.Patient information and cost data for elderly inpatients(age≥60 years,n=3362)who were hospitalized between 2016 and 2019 due to falls was collected from the medical record systems of two grade-A tertiary hospitals in China.Quantile regression(QR)analysis was used to identify the factors related to fall-related hospitalization costs.A decision tree model based on the chi-squared automatic interaction detector algorithm for hospitalization cost grouping was built by setting the factors in the regression results as separation nodes.RESULTS The total hospitalization cost of fall-related injuries in the included elderly patients was 180479203.03 RMB,and the reimbursement rate of medical benefit funds was 51.0%(92039709.52 RMB/180479203.03 RMB).The medical material costs were the highest component of the total hospitalization cost,followed(in order)by drug costs,test costs,treatment costs,integrated medical service costs and blood transfusion costs The QR results showed that patient age,gender,length of hospital stay,payment method,wound position,wound type,operation times and operation type significantly influenced the inpatient cost(P<0.05).The cost grouping model was established based on the QR results,and age,length of stay,operation type,wound position and wound type were the most important influencing factors in the model.Furthermore,the cost of each combination varied significantly.CONCLUSION Our grouping model of hospitalization costs clearly reflected the key factors affecting hospitalization costs and can be used to strengthen the reasonable control of these costs.展开更多
The allocation of resources in English teaching can improve the ability of resource sharing, in order to optimize the allocation of resources, so as to improve the performance of English teaching, and promote the cons...The allocation of resources in English teaching can improve the ability of resource sharing, in order to optimize the allocation of resources, so as to improve the performance of English teaching, and promote the construction of English teaching resources database, a method of optimizing the allocation of English teaching resources is proposed based on network cloud platform. Text semantic key words conceptual decision tree model is constructed for massive English teaching resources allocation, semantic information conversion method is used to compute key semantic features of English Teaching resources, the concept convergence point of English Teaching resource allocation is formed in semantic model. According to the set between the upper and lower relationship, a decision tree model of English Teaching semantic subject words is constructed, semantic conversion and information extraction are realized. English teaching resources optimization allocation simulation is taken in the cloud platform, simulation results show that the scheduling performance of English teaching resources is better, and the adaptive allocation ability of English teaching resources is stronger, and the resource utilization rate is higher.展开更多
Patients with lung cancer at the same stage may have markedly different overall outcome and a lack of specific biomarker to predict lung cancer outcome.Heat-shock protein 90β(HSP90β)is overexpressed in various tumor...Patients with lung cancer at the same stage may have markedly different overall outcome and a lack of specific biomarker to predict lung cancer outcome.Heat-shock protein 90β(HSP90β)is overexpressed in various tumor cells.In this study,the ELISA results of HSP90βcombined with CEA,CA125,and CYFRA21-1 were used to construct a recursive partitioning decision tree model to establish a four-protein diagnostic model and predict the survival of patients with lung cancer.Survival analysis showed that the recursive partitioning decision tree could distinguish the prognosis between high-and low-risk groups.Results suggested that the joint detection of HSP90β,CEA,CA125,and CYFRA21-1 in the peripheral blood of patients with lung cancer is plausible for early diagnosis and prognosis prediction of lung cancer.展开更多
The probability of default(PD) is the key element in the New Basel Capital Accord and the most essential factor to financial institutions' risk management.To obtain good PD estimation,practitioners and academics h...The probability of default(PD) is the key element in the New Basel Capital Accord and the most essential factor to financial institutions' risk management.To obtain good PD estimation,practitioners and academics have put forward numerous default prediction models.However,how to use multiple models to enhance overall performance on default prediction remains untouched.In this paper,a parametric and non-parametric combination model is proposed.Firstly,binary logistic regression model(BLRM),support vector machine(SVM),and decision tree(DT) are used respectively to establish models with relatively stable and high performance.Secondly,in order to make further improvement to the overall performance,a combination model using the method of multiple discriminant analysis(MDA) is constructed.In this way,the coverage rate of the combination model is greatly improved,and the risk of miscarriage is effectively reduced.Lastly,the results of the combination model are analyzed by using the K-means clustering,and the clustering distribution is consistent with a normal distribution.The results show that the combination model based on parametric and non-parametric can effectively enhance the overall performance on default prediction.展开更多
文摘The North China Plain and the agricultural region are crossed by the Shanxi-Beijing natural gas pipeline.Resi-dents in the area use rototillers for planting and harvesting;however,the depth of the rototillers into the ground is greater than the depth of the pipeline,posing a significant threat to the safe operation of the pipeline.Therefore,it is of great significance to study the dynamic response of rotary tillers impacting pipelines to ensure the safe opera-tion of pipelines.This article focuses on the Shanxi-Beijing natural gas pipeline,utilizingfinite element simulation software to establish afinite element model for the interaction among the machinery,pipeline,and soil,and ana-lyzing the dynamic response of the pipeline.At the same time,a decision tree model is introduced to classify the damage of pipelines under different working conditions,and the boundary value and importance of each influen-cing factor on pipeline damage are derived.Considering the actual conditions in the hemp yam planting area,targeted management measures have been proposed to ensure the operational safety of the Shanxi-Beijing natural gas pipeline in this region.
基金This research was supported by Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(2016R1A6A3A11934917).
文摘Research on the quality of data in a structural calculation document(SCD)is lacking,although the SCD ofa bridge is used as an essential reference during the entire lifecycle of the facility.XML Schema matching enables qualitative improvement of the stored data.This study aimed to enhance the applicability of XML Schema matching,which improves the speed and quality of information stored in bridge SCDs.First,the authors proposed a method of reducing the computing time for the schema matching of bridge SCDs.The computing speed of schema matching was increased by 13 to 1800 times by reducing the checking process of the correlations.Second,the authors developed a heuristic solution for selecting the optimal weight factors used in the matching process to maintain a high accuracy by introducing a decision tree.The decision tree model was built using the content elements stored in the SCD,design companies,bridge types,and weight factors as input variables,and the matching accuracy as the target variable.The inverse-calculation method was applied to extract the weight factors from the decision tree model for high-accuracy schema matching results.
文摘Gully erosion is a disruptive phenomenon which extensively affects the Iranian territory,especially in the Northern provinces.A number of studies have been recently undertaken to study this process and to predict it over space and ultimately,in a broader national effort,to limit its negative effects on local communities.We focused on the Bastam watershed where 9.3%of its surface is currently affected by gullying.Machine learning algorithms are currently under the magnifying glass across the geomorphological community for their high predictive ability.However,unlike the bivariate statistical models,their structure does not provide intuitive and quantifiable measures of environmental preconditioning factors.To cope with such weakness,we interpret preconditioning causes on the basis of a bivariate approach namely,Index of Entropy.And,we performed the susceptibility mapping procedure by testing three extensions of a decision tree model namely,Alternating Decision Tree(ADTree),Naive-Bayes tree(NBTree),and Logistic Model Tree(LMT).We dichotomized the gully information over space into gully presence/absence conditions,which we further explored in their calibration and validation stages.Being the presence/absence information and associated factors identical,the resulting differences are only due to the algorithmic structures of the three models we chose.Such differences are not significant in terms of performances;in fact,the three models produce outstanding predictive AUC measures(ADTree=0.922;NBTree=0.939;LMT=0.944).However,the associated mapping results depict very different patterns where only the LMT is associated with reasonable susceptibility patterns.This is a strong indication of what model combines best performance and mapping for any natural hazard-oriented application.
文摘Students in South African Universities come from different socio-cultural backgrounds, countries and high schools. This suggests that these students have different experiences which impact on their levels of grasping information in class as they potentially use different lenses on tuition. The current practice in Universities in contributing to the academic performance of students includes the use of tutors, the use of mobile devices for first year students, use of student assistants and the use of different feedback measures. What is problematic about the current practice is that students are quitting university in high numbers. In this study, knowledge has been drawn from data through the use of machine learning algorithms. Bayesian networks, support vector machines (SVMs) and decision trees algorithms were used individually in this work to construct predictive models for the academic performance of students. The best model was constructed using SVM and it gave a prediction of 72.87% and a prediction cost of 139. The model does predict the performance of students in advance of the year-end examinations outcome. The results suggest that South African Universities must recognize the diversity in student population and thus provide students with better support and equip them with the necessary knowledge that will enable them to tap into their full potential and thus enhance their skills.
基金Supported by The National Key Research and Development Project,No.2020YFC2005900.
文摘BACKGROUND With the aging world population,the incidence of falls has intensified and fallrelated hospitalization costs are increasing.Falls are one type of event studied in the health economics of patient safety,and many developed countries have conducted such research on fall-related hospitalization costs.However,China,a developing country,still lacks large-scale studies in this area.AIM To investigate the factors related to the hospitalization costs of fall-related injuries in elderly inpatients and establish factor-based,cost-related groupings.METHODS A retrospective study was conducted.Patient information and cost data for elderly inpatients(age≥60 years,n=3362)who were hospitalized between 2016 and 2019 due to falls was collected from the medical record systems of two grade-A tertiary hospitals in China.Quantile regression(QR)analysis was used to identify the factors related to fall-related hospitalization costs.A decision tree model based on the chi-squared automatic interaction detector algorithm for hospitalization cost grouping was built by setting the factors in the regression results as separation nodes.RESULTS The total hospitalization cost of fall-related injuries in the included elderly patients was 180479203.03 RMB,and the reimbursement rate of medical benefit funds was 51.0%(92039709.52 RMB/180479203.03 RMB).The medical material costs were the highest component of the total hospitalization cost,followed(in order)by drug costs,test costs,treatment costs,integrated medical service costs and blood transfusion costs The QR results showed that patient age,gender,length of hospital stay,payment method,wound position,wound type,operation times and operation type significantly influenced the inpatient cost(P<0.05).The cost grouping model was established based on the QR results,and age,length of stay,operation type,wound position and wound type were the most important influencing factors in the model.Furthermore,the cost of each combination varied significantly.CONCLUSION Our grouping model of hospitalization costs clearly reflected the key factors affecting hospitalization costs and can be used to strengthen the reasonable control of these costs.
文摘The allocation of resources in English teaching can improve the ability of resource sharing, in order to optimize the allocation of resources, so as to improve the performance of English teaching, and promote the construction of English teaching resources database, a method of optimizing the allocation of English teaching resources is proposed based on network cloud platform. Text semantic key words conceptual decision tree model is constructed for massive English teaching resources allocation, semantic information conversion method is used to compute key semantic features of English Teaching resources, the concept convergence point of English Teaching resource allocation is formed in semantic model. According to the set between the upper and lower relationship, a decision tree model of English Teaching semantic subject words is constructed, semantic conversion and information extraction are realized. English teaching resources optimization allocation simulation is taken in the cloud platform, simulation results show that the scheduling performance of English teaching resources is better, and the adaptive allocation ability of English teaching resources is stronger, and the resource utilization rate is higher.
基金was granted by CAMS Innovation Fund for Medical Sciences(CIFMS)(No.2016-I2M-1-001).
文摘Patients with lung cancer at the same stage may have markedly different overall outcome and a lack of specific biomarker to predict lung cancer outcome.Heat-shock protein 90β(HSP90β)is overexpressed in various tumor cells.In this study,the ELISA results of HSP90βcombined with CEA,CA125,and CYFRA21-1 were used to construct a recursive partitioning decision tree model to establish a four-protein diagnostic model and predict the survival of patients with lung cancer.Survival analysis showed that the recursive partitioning decision tree could distinguish the prognosis between high-and low-risk groups.Results suggested that the joint detection of HSP90β,CEA,CA125,and CYFRA21-1 in the peripheral blood of patients with lung cancer is plausible for early diagnosis and prognosis prediction of lung cancer.
基金supported by the National Natural Science Foundation of China Key Project under Grant No.70933003the National Natural Science Foundation of China under Grant Nos.70871109 and 71203247
文摘The probability of default(PD) is the key element in the New Basel Capital Accord and the most essential factor to financial institutions' risk management.To obtain good PD estimation,practitioners and academics have put forward numerous default prediction models.However,how to use multiple models to enhance overall performance on default prediction remains untouched.In this paper,a parametric and non-parametric combination model is proposed.Firstly,binary logistic regression model(BLRM),support vector machine(SVM),and decision tree(DT) are used respectively to establish models with relatively stable and high performance.Secondly,in order to make further improvement to the overall performance,a combination model using the method of multiple discriminant analysis(MDA) is constructed.In this way,the coverage rate of the combination model is greatly improved,and the risk of miscarriage is effectively reduced.Lastly,the results of the combination model are analyzed by using the K-means clustering,and the clustering distribution is consistent with a normal distribution.The results show that the combination model based on parametric and non-parametric can effectively enhance the overall performance on default prediction.