Malware is an ever-present and dynamic threat to networks and computer systems in cybersecurity,and because of its complexity and evasiveness,it is challenging to identify using traditional signature-based detection a...Malware is an ever-present and dynamic threat to networks and computer systems in cybersecurity,and because of its complexity and evasiveness,it is challenging to identify using traditional signature-based detection approaches.The study article discusses the growing danger to cybersecurity that malware hidden in PDF files poses,highlighting the shortcomings of conventional detection techniques and the difficulties presented by adversarial methodologies.The article presents a new method that improves PDF virus detection by using document analysis and a Logistic Model Tree.Using a dataset from the Canadian Institute for Cybersecurity,a comparative analysis is carried out with well-known machine learning models,such as Credal Decision Tree,Naïve Bayes,Average One Dependency Estimator,Locally Weighted Learning,and Stochastic Gradient Descent.Beyond traditional structural and JavaScript-centric PDF analysis,the research makes a substantial contribution to the area by boosting precision and resilience in malware detection.The use of Logistic Model Tree,a thorough feature selection approach,and increased focus on PDF file attributes all contribute to the efficiency of PDF virus detection.The paper emphasizes Logistic Model Tree’s critical role in tackling increasing cybersecurity threats and proposes a viable answer to practical issues in the sector.The results reveal that the Logistic Model Tree is superior,with improved accuracy of 97.46%when compared to benchmark models,demonstrating its usefulness in addressing the ever-changing threat landscape.展开更多
The North China Plain and the agricultural region are crossed by the Shanxi-Beijing natural gas pipeline.Resi-dents in the area use rototillers for planting and harvesting;however,the depth of the rototillers into the...The North China Plain and the agricultural region are crossed by the Shanxi-Beijing natural gas pipeline.Resi-dents in the area use rototillers for planting and harvesting;however,the depth of the rototillers into the ground is greater than the depth of the pipeline,posing a significant threat to the safe operation of the pipeline.Therefore,it is of great significance to study the dynamic response of rotary tillers impacting pipelines to ensure the safe opera-tion of pipelines.This article focuses on the Shanxi-Beijing natural gas pipeline,utilizingfinite element simulation software to establish afinite element model for the interaction among the machinery,pipeline,and soil,and ana-lyzing the dynamic response of the pipeline.At the same time,a decision tree model is introduced to classify the damage of pipelines under different working conditions,and the boundary value and importance of each influen-cing factor on pipeline damage are derived.Considering the actual conditions in the hemp yam planting area,targeted management measures have been proposed to ensure the operational safety of the Shanxi-Beijing natural gas pipeline in this region.展开更多
BACKGROUND Development of distant metastasis(DM)is a major concern during treatment of nasopharyngeal carcinoma(NPC).However,studies have demonstrated im-proved distant control and survival in patients with advanced N...BACKGROUND Development of distant metastasis(DM)is a major concern during treatment of nasopharyngeal carcinoma(NPC).However,studies have demonstrated im-proved distant control and survival in patients with advanced NPC with the addition of chemotherapy to concomitant chemoradiotherapy.Therefore,precise prediction of metastasis in patients with NPC is crucial.AIM To develop a predictive model for metastasis in NPC using detailed magnetic resonance imaging(MRI)reports.METHODS This retrospective study included 792 patients with non-distant metastatic NPC.A total of 469 imaging variables were obtained from detailed MRI reports.Data were stratified and randomly split into training(50%)and testing sets.Gradient boosting tree(GBT)models were built and used to select variables for predicting DM.A full model comprising all variables and a reduced model with the top-five variables were built.Model performance was assessed by area under the curve(AUC).RESULTS Among the 792 patients,94 developed DM during follow-up.The number of metastatic cervical nodes(30.9%),tumor invasion in the posterior half of the nasal cavity(9.7%),two sides of the pharyngeal recess(6.2%),tubal torus(3.3%),and single side of the parapharyngeal space(2.7%)were the top-five contributors for predicting DM,based on their relative importance in GBT models.The testing AUC of the full model was 0.75(95%confidence interval[CI]:0.69-0.82).The testing AUC of the reduced model was 0.75(95%CI:0.68-0.82).For the whole dataset,the full(AUC=0.76,95%CI:0.72-0.82)and reduced models(AUC=0.76,95%CI:0.71-0.81)outperformed the tumor node-staging system(AUC=0.67,95%CI:0.61-0.73).CONCLUSION The GBT model outperformed the tumor node-staging system in predicting metastasis in NPC.The number of metastatic cervical nodes was identified as the principal contributing variable.展开更多
With the increasing availability of precipitation radar data from space,enhancement of the resolution of spaceborne precipitation observations is important,particularly for hazard prediction and climate modeling at lo...With the increasing availability of precipitation radar data from space,enhancement of the resolution of spaceborne precipitation observations is important,particularly for hazard prediction and climate modeling at local scales relevant to extreme precipitation intensities and gradients.In this paper,the statistical characteristics of radar precipitation reflectivity data are studied and modeled using a hidden Markov tree(HMT)in the wavelet domain.Then,a high-resolution interpolation algorithm is proposed for spaceborne radar reflectivity using the HMT model as prior information.Owing to the small and transient storm elements embedded in the larger and slowly varying elements,the radar precipitation data exhibit distinct multiscale statistical properties,including a non-Gaussian structure and scale-to-scale dependency.An HMT model can capture well the statistical properties of radar precipitation,where the wavelet coefficients in each sub-band are characterized as a Gaussian mixture model(GMM),and the wavelet coefficients from the coarse scale to fine scale are described using a multiscale Markov process.The state probabilities of the GMM are determined using the expectation maximization method,and other parameters,for instance,the variance decay parameters in the HMT model are learned and estimated from high-resolution ground radar reflectivity images.Using the prior model,the wavelet coefficients at finer scales are estimated using local Wiener filtering.The interpolation algorithm is validated using data from the precipitation radar onboard the Tropical Rainfall Measurement Mission satellite,and the reconstructed results are found to be able to enhance the spatial resolution while optimally reproducing the local extremes and gradients.展开更多
Tree mortality plays a fundamental role in the dynamics of forest ecosystems,yet it is one of the most difficult phenomena to accurately predict.Various modeling strategies have been developed to improve individual tr...Tree mortality plays a fundamental role in the dynamics of forest ecosystems,yet it is one of the most difficult phenomena to accurately predict.Various modeling strategies have been developed to improve individual tree mortality predictions.One less explored strategy is the use of a multistage modeling approach.Potential improvements from this approach have remained largely unknown.In this study,we developed a novel multistage approach and compared its performance in individual tree mortality predictions with a more conventional approach using an identical individual tree mortality model formulation.Extensive permanent plot data(n=9442)covering the Acadian Region of North America and over multiple decades(1965–2014)were used in this study.Our results indicated that the model behavior with the multistage approach better depicted the observed mortality and showed a notable improvement over the conventional approach.The difference between the observed and predicted numbers of dead trees using the multistage approach was much smaller when compared with the conventional approach.In addition,tree survival probabilities predicted by the multistage approach generally were not significantly different from the observations,whereas the conventional approach consistently underestimated mortality across species and overestimated tree survival probabilities over the large range of DBH in the data.The new multistage approach also predictions of zero mortality in individual plots,a result not possible in conventional models.Finally,the new approach was more tolerant of modeling errors because it based estimates on ranked tree mortality rather than error-prone predicted values.Overall,this new multistage approach deserves to be considered and tested in future studies.展开更多
Digital aerial photograph(DAP)data is processed based on Structure from Motion(Sf M)algorithm and regional net adjustment method to generate digital surface discrete point clouds similar to Light Detection and Ranging...Digital aerial photograph(DAP)data is processed based on Structure from Motion(Sf M)algorithm and regional net adjustment method to generate digital surface discrete point clouds similar to Light Detection and Ranging(LiDAR)and digital orthophoto mosaic(DOM)similar to optical remote sensing image.In this study,we obtained highresolution images of mature forests of Chinese fir by unmanned aerial vehicle(UAV)flying through crossroute flight,and then reconstructed the threedimensional point clouds in the UAV aerial area by SfM technique.The point cloud segmentation(PCS)algorithm was used for the individual tree segmentation,and the F-score of the three sample plots were 0.91,0.94,and 0.94,respectively.Individual tree biomass modeling was conducted using 155 mature Chinese fir forests which were correctly segmented.The relative root mean squared error(rRMSE)values of random forest(RF),bagged tree(BT)and support vector regression(SVR)were 34.48%,35.74%and 40.93%,respectively.Our study demonstrated that DAP point clouds had great potential to extract forest vertical parameters and could be applied successfully in individual tree segmentation and individual tree biomass modeling.展开更多
Objective: The progression of human cancer is characterized by the accumulation of genetic instability. An increasing number of experimental genetic molecular techniques have been used to detect chromosome aberration...Objective: The progression of human cancer is characterized by the accumulation of genetic instability. An increasing number of experimental genetic molecular techniques have been used to detect chromosome aberrations. Previous studies on chromosome abnormalities often focused on identifying the frequent loci of chromosome alterations, but rarely addressed the issue of interrelationship of chromosomal abnormalities. In the last few years, several mathematical models have been employed to construct models of carcinogenesis, in an attempt to identify the time order and cause-and-effect relationship of chromosome aberrations. The principles and applications of these models are reviewed and compared in this paper. Mathematical modeling of carcinogenesis can contribute to our understanding of the molecular genetics of tumor development, and identification of cancer related genes, thus leading to improved clinical practice of cancer.展开更多
Tyre pressure monitoring system(TPMS)is compulsory in most countries like the United States and European Union.The existing systems depend on pressure sensors strapped on the tyre or on wheel speed sensor data.A diffe...Tyre pressure monitoring system(TPMS)is compulsory in most countries like the United States and European Union.The existing systems depend on pressure sensors strapped on the tyre or on wheel speed sensor data.A difference in wheel speed would trigger an alarm based on the algorithm implemented.In this paper,machine learning approach is proposed as a new method to monitor tyre pressure by extracting the vertical vibrations from a wheel hub of a moving vehicle using an accelerometer.The obtained signals will be used to compute through statistical features and histogram features for the feature extraction process.The LMT(Logistic Model Tree)was used as the classifier and attained a classification accuracy of 92.5%with 10-fold cross validation for statistical features and 90.5% with 10-fold cross validation for histogram features.The proposed model can be used for monitoring the automobile tyre pressure successfully.展开更多
This study was conducted to evaluate the performance of six stem taper models on four tropical tree species, namely Celtis luzonica(Magabuyo),Diplodiscus paniculatus(Balobo), Parashorea malaanonan(Bagtikan), and Swiet...This study was conducted to evaluate the performance of six stem taper models on four tropical tree species, namely Celtis luzonica(Magabuyo),Diplodiscus paniculatus(Balobo), Parashorea malaanonan(Bagtikan), and Swietenia macrophylla(Mahogany) in Mount Makiling Forest Reserve(MMFR), Philippines using fit statistics and lack-of-fit statistics. Four statistical criteria were used in this study, including the standard error of estimate(SEE),coefficient of determination(R^2), mean bias( E),and absolute mean difference(AMD). For the lack-offit statistics, SEE, E and AMD were determined in different relative height classes. The results indicated that the Kozak02 stem taper model offered the best fit for the four tropical species in most statistics. The Kozak02 model also consistently provided the best performance in the lack-of-fit statistics with the best SEE, E and AMD in most of the relative height classes. These stem taper equations could help forest managers and researchers better estimate the diameter of the outside bark with any given height,merchantable stem volumes and total stem volumes of standing trees belonging to the four species of thetropical forest in MMFR.展开更多
AIM: To construct tree models for classification of diffuse large B-cell lymphomas (DLBCL) by chromosome copy numbers, to compare them with cDNA microarray classification, and to explore models of multi-gene, multi-st...AIM: To construct tree models for classification of diffuse large B-cell lymphomas (DLBCL) by chromosome copy numbers, to compare them with cDNA microarray classification, and to explore models of multi-gene, multi-step and multi-pathway processes of DLBCL tumorigenesis. METHODS: Maximum-weight branching and distancebased models were constructed based on the comparative genomic hybridization (CGH) data of 123 DLBCL samples using the established methods and software of Desper et al . A maximum likelihood tree model was also used to analyze the data. By comparing with the results reported in literature, values of tree models in the classification of DLBCL were elucidated. RESULTS: Both the branching and the distance-based trees classified DLBCL into three groups. We combined the classification methods of the two models and classified DLBCL into three categories according to their characteristics. The first group was marked by +Xq, +Xp, -17p and +13q; the second group by +3q, +18q and +18p; and the third group was marked by -6q and +6p. This chromosomal classification was consistent with cDNA classification. It indicated that -6q and +3q were two main events in the tumorigenesis of lymphoma. CONCLUSION: Tree models of lymphoma established from CGH data can be used in the classification of DLBCL. These models can suggest multi-gene, multistep and multi-pathway processes of tumorigenesis. Two pathways, -6q preceding +6q and +3q preceding+18q, may be important in understanding tumorigenesis of DLBCL. The pathway, -6q preceding +6q, may have a close relationship with the tumorigenesis of non-GCB DLBCL.展开更多
The artificial pure and mixed Korean pine (Pinus koraiensis) forests were investigated at Dailing Forestry Bureau in Xiaoxing'an mountains from 1990 to 1992. Depending on the distance between the samplings of Kore...The artificial pure and mixed Korean pine (Pinus koraiensis) forests were investigated at Dailing Forestry Bureau in Xiaoxing'an mountains from 1990 to 1992. Depending on the distance between the samplings of Korean pine and their neighbor trees, the neighbor tree height, the size of neighbor tree canopy, and dimension of neighbor tree. The forest structure was classified into three types: (1) prowth of a tree in the light (open), (2) Growth of a tree in the canopy gap (Gap), (3)Growth of a tree under broad-leaved tree canopy. The frequeney, height, and age of stem divergence of Korean pine tree were investigated by sampling trees. The temporal and spatial model of the tree growth was applied on basis of the height of stem divergence, ratio of height and DBH, and character of tree stem.The morphology and growth character of Korean pine trees during different development stage were forecasted.展开更多
Background: A novel approach to modelling individual tree growth dynamics is proposed. The approach combines multiple imputation and copula sampling to produce a stochastic individual tree growth and yield projection...Background: A novel approach to modelling individual tree growth dynamics is proposed. The approach combines multiple imputation and copula sampling to produce a stochastic individual tree growth and yield projection system. Methods: The Nova Scotia, Canada permanent sample plot network is used as a case study to develop and test the modelling approach. Predictions from this model are compared to predictions from the Acadian variant of the Forest Vegetation Simulator, a widely used statistical individual tree growth and yield model. Results: Diameter and height growth rates were predicted with error rates consistent with those produced using statistical models. Mortality and ingrowth error rates were higher than those observed for diameter and height, but also were within the bounds produced by traditional approaches for predicting these rates. Ingrowth species composition was very poorly predicted. The model was capable of reproducing a wide range of stand dynamic trajectories and in some cases reproduced trajectories that the statistical model was incapable of reproducing. Conclusions: The model has potential to be used as a benchmarking tool for evaluating statistical and process models and may provide a mechanism to separate signal from noise and improve our ability to analyze and learn from large regional datasets that often have underlying flaws in sample design.展开更多
In order to effectively avoid the defects of a traditional discounted cash flow method, a trinomial tree pricing model of the real option is improved and used to forecast the investment price of mining. Taking Molybde...In order to effectively avoid the defects of a traditional discounted cash flow method, a trinomial tree pricing model of the real option is improved and used to forecast the investment price of mining. Taking Molybdenum ore as an example, a theoretical model for the hurdle price under the optimal investment timing is constructed. Based on the example data, the op- tion price model is simulated. By the model, mine investment price can be computed and forecast effectively. According to the characteristics of mine investment, cut-off grade, reserve estimation and mine life in different price also can be quantified. The result shows that it is reliable and practical to enhance the accuracy for mining investment decision.展开更多
Background: Growth and yield models are important tools for forest planning. Due to its geographic location, topology, and history of management, the forests of the Adirondacks Region of New York are unique and compl...Background: Growth and yield models are important tools for forest planning. Due to its geographic location, topology, and history of management, the forests of the Adirondacks Region of New York are unique and complex. However, only a relatively limited number of growth and yield models have been developed and/or can be reasonably extended to this region currently. Methods: in this analysis, 571 long-term continuous forest inventory plots with a total of 10 - 52 years of measurement data from four experimental forests maintained by the State University of New York College of Environmental Science and Forestry and one nonindustrial private forest were used to develop an individual tree growth model for the primary hardwood and softwood species in the region. Species-specific annualized static and dynamic equations were developed using the available data and the system was evaluated for long-term behavior. Results: Equivalence tests indicated that the Northeast Variant of the Forest Vegetation Simulator (FVS-NE) was biased in its estimation of tree total and bole height, diameter and height increment, and mortality for most species examined. In contrast, the developed static and annualized dynamic, species-specific equations performed quite well given the underlying variability in the data. Long-term model projections were consistent with the data and suggest a relatively robust system for prediction. Conclusions: Overall, the developed growth model showed reasonable behavior and is a significant improvement over existing models for the region. The model also highlighted the complexities of forest dynamics in the region and should help improve forest planning efforts there.展开更多
In the present article it will be critically questioned the traditional entrepreneurship education approaches based on a narrow conception of competency, and their values. Assuming the perspective that to be an entrep...In the present article it will be critically questioned the traditional entrepreneurship education approaches based on a narrow conception of competency, and their values. Assuming the perspective that to be an entrepreneur is basically an attitude towards life and the world, there proposed holistic, constructivist and experiential processes and strategies for entrepreneurship education. The "entrepreneur XXI", must be able to undertake a social function of change, so, an economical and social development more human, ethical and intelligent. Under this assumption, the "Tree Model for the Development of Entrepreneurial Competencies", that will be discussed globally in the second part of this article, suggests a dynamic and experiential approach ofentrepreneurship education based on the qualification of people's behaviour, self-esteem, competencies and experiences; a profile of key behavioural and performance competencies (root), experimental pedagogical procedures (trunk) and real results within group projects (fruits). This model has been developed during the last decade (2001-2011), using a multidisciplinary research-action procedure, within business, education (at different teaching levels) and social project environments.展开更多
A model of using binomial tree pricing formulae in a fuzzy market is proposed. In the fuzzy market, a price interval can be got according to the belief degree. The rule for the reasonability of the price interval is p...A model of using binomial tree pricing formulae in a fuzzy market is proposed. In the fuzzy market, a price interval can be got according to the belief degree. The rule for the reasonability of the price interval is proposed. The explicit expression of the interval is discussed in some special settings.展开更多
Scour is a natural phenomenon that is created by the rivers streams or the flood which brings about transferring or eroding of bed materials. To have accurate and safe erosion control structures design, maximum scour ...Scour is a natural phenomenon that is created by the rivers streams or the flood which brings about transferring or eroding of bed materials. To have accurate and safe erosion control structures design, maximum scour depth in downstream of the structures gains specific significance. In the current study, M5 model tree as remedy data mining approaches is suggested to estimate the scour depth around the abutments. To do this, Kayaturk laboratory data (2005), with different hydraulic conditions, are used. Then, the results of M5 model were also compared with genetic programming (GP) and pervious empirical results to investigate the applicability, ability, and accuracy of these procedures. To examine the accuracy of the results yielded from the M5 and GP procedures, two performance indicators (determination coefficient (R2) and root mean square error (RMSE)) were used. The comparison test of results clearly shows that the implementation of M5 technique sounds satisfactory regarding the performance indicators (R<sup>2</sup> = 0.944 and RMSE = 0.126) with less deviation from the numerical values. In addition, M5 tree model, by presenting relationships based on liner regression, has good capability to estimate the depth of scour abutment for engineers in practical terms.展开更多
The ongoing effort to create methods for detecting and quantifying fatigue damage is motivated by the high levels of uncertainty in present fatigue-life prediction approaches and the frequently catastrophic nature of ...The ongoing effort to create methods for detecting and quantifying fatigue damage is motivated by the high levels of uncertainty in present fatigue-life prediction approaches and the frequently catastrophic nature of fatigue failure.The fatigue life of high strength aluminum alloy 2090-T83 is predicted in this study using a variety of artificial intelligence and machine learning techniques for constant amplitude and negative stress ratios(R?1).Artificial neural networks(ANN),adaptive neuro-fuzzy inference systems(ANFIS),support-vector machines(SVM),a random forest model(RF),and an extreme-gradient tree-boosting model(XGB)are trained using numerical and experimental input data obtained from fatigue tests based on a relatively low number of stress measurements.In particular,the coefficients of the traditional force law formula are found using relevant numerical methods.It is shown that,in comparison to traditional approaches,the neural network and neuro-fuzzy models produce better results,with the neural network models trained using the boosting iterations technique providing the best performances.Building strong models from weak models,XGB helps to predict fatigue life by reducing model partiality and variation in supervised learning.Fuzzy neural models can be used to predict the fatigue life of alloys more accurately than neural networks and traditional methods.展开更多
The dead fuel moisture content(DFMC)is the key driver leading to fire occurrence.Accurately estimating the DFMC could help identify locations facing fire risks,prioritise areas for fire monitoring,and facilitate timel...The dead fuel moisture content(DFMC)is the key driver leading to fire occurrence.Accurately estimating the DFMC could help identify locations facing fire risks,prioritise areas for fire monitoring,and facilitate timely deployment of fire-suppression resources.In this study,the DFMC and environmental variables,including air temperature,relative humidity,wind speed,solar radiation,rainfall,atmospheric pressure,soil temperature,and soil humidity,were simultaneously measured in a grassland of Ergun City,Inner Mongolia Autonomous Region of China in 2021.We chose three regression models,i.e.,random forest(RF)model,extreme gradient boosting(XGB)model,and boosted regression tree(BRT)model,to model the seasonal DFMC according to the data collected.To ensure accuracy,we added time-lag variables of 3 d to the models.The results showed that the RF model had the best fitting effect with an R2value of 0.847 and a prediction accuracy with a mean absolute error score of 4.764%among the three models.The accuracies of the models in spring and autumn were higher than those in the other two seasons.In addition,different seasons had different key influencing factors,and the degree of influence of these factors on the DFMC changed with time lags.Moreover,time-lag variables within 44 h clearly improved the fitting effect and prediction accuracy,indicating that environmental conditions within approximately 48 h greatly influence the DFMC.This study highlights the importance of considering 48 h time-lagged variables when predicting the DFMC of grassland fuels and mapping grassland fire risks based on the DFMC to help locate high-priority areas for grassland fire monitoring and prevention.展开更多
基金This research work was funded by Institutional Fund Projects under Grant No.(IFPIP:211-611-1443).
文摘Malware is an ever-present and dynamic threat to networks and computer systems in cybersecurity,and because of its complexity and evasiveness,it is challenging to identify using traditional signature-based detection approaches.The study article discusses the growing danger to cybersecurity that malware hidden in PDF files poses,highlighting the shortcomings of conventional detection techniques and the difficulties presented by adversarial methodologies.The article presents a new method that improves PDF virus detection by using document analysis and a Logistic Model Tree.Using a dataset from the Canadian Institute for Cybersecurity,a comparative analysis is carried out with well-known machine learning models,such as Credal Decision Tree,Naïve Bayes,Average One Dependency Estimator,Locally Weighted Learning,and Stochastic Gradient Descent.Beyond traditional structural and JavaScript-centric PDF analysis,the research makes a substantial contribution to the area by boosting precision and resilience in malware detection.The use of Logistic Model Tree,a thorough feature selection approach,and increased focus on PDF file attributes all contribute to the efficiency of PDF virus detection.The paper emphasizes Logistic Model Tree’s critical role in tackling increasing cybersecurity threats and proposes a viable answer to practical issues in the sector.The results reveal that the Logistic Model Tree is superior,with improved accuracy of 97.46%when compared to benchmark models,demonstrating its usefulness in addressing the ever-changing threat landscape.
文摘The North China Plain and the agricultural region are crossed by the Shanxi-Beijing natural gas pipeline.Resi-dents in the area use rototillers for planting and harvesting;however,the depth of the rototillers into the ground is greater than the depth of the pipeline,posing a significant threat to the safe operation of the pipeline.Therefore,it is of great significance to study the dynamic response of rotary tillers impacting pipelines to ensure the safe opera-tion of pipelines.This article focuses on the Shanxi-Beijing natural gas pipeline,utilizingfinite element simulation software to establish afinite element model for the interaction among the machinery,pipeline,and soil,and ana-lyzing the dynamic response of the pipeline.At the same time,a decision tree model is introduced to classify the damage of pipelines under different working conditions,and the boundary value and importance of each influen-cing factor on pipeline damage are derived.Considering the actual conditions in the hemp yam planting area,targeted management measures have been proposed to ensure the operational safety of the Shanxi-Beijing natural gas pipeline in this region.
文摘BACKGROUND Development of distant metastasis(DM)is a major concern during treatment of nasopharyngeal carcinoma(NPC).However,studies have demonstrated im-proved distant control and survival in patients with advanced NPC with the addition of chemotherapy to concomitant chemoradiotherapy.Therefore,precise prediction of metastasis in patients with NPC is crucial.AIM To develop a predictive model for metastasis in NPC using detailed magnetic resonance imaging(MRI)reports.METHODS This retrospective study included 792 patients with non-distant metastatic NPC.A total of 469 imaging variables were obtained from detailed MRI reports.Data were stratified and randomly split into training(50%)and testing sets.Gradient boosting tree(GBT)models were built and used to select variables for predicting DM.A full model comprising all variables and a reduced model with the top-five variables were built.Model performance was assessed by area under the curve(AUC).RESULTS Among the 792 patients,94 developed DM during follow-up.The number of metastatic cervical nodes(30.9%),tumor invasion in the posterior half of the nasal cavity(9.7%),two sides of the pharyngeal recess(6.2%),tubal torus(3.3%),and single side of the parapharyngeal space(2.7%)were the top-five contributors for predicting DM,based on their relative importance in GBT models.The testing AUC of the full model was 0.75(95%confidence interval[CI]:0.69-0.82).The testing AUC of the reduced model was 0.75(95%CI:0.68-0.82).For the whole dataset,the full(AUC=0.76,95%CI:0.72-0.82)and reduced models(AUC=0.76,95%CI:0.71-0.81)outperformed the tumor node-staging system(AUC=0.67,95%CI:0.61-0.73).CONCLUSION The GBT model outperformed the tumor node-staging system in predicting metastasis in NPC.The number of metastatic cervical nodes was identified as the principal contributing variable.
基金This study was funded by the National Natural Science Foundation of China(Grant No.41975027)the Natural Science Foundation of Jiangsu Province(Grant No.BK20171457)the National Key R&D Program on Monitoring,Early Warning and Prevention of Major Natural Disasters(Grant No.2017YFC1501401).
文摘With the increasing availability of precipitation radar data from space,enhancement of the resolution of spaceborne precipitation observations is important,particularly for hazard prediction and climate modeling at local scales relevant to extreme precipitation intensities and gradients.In this paper,the statistical characteristics of radar precipitation reflectivity data are studied and modeled using a hidden Markov tree(HMT)in the wavelet domain.Then,a high-resolution interpolation algorithm is proposed for spaceborne radar reflectivity using the HMT model as prior information.Owing to the small and transient storm elements embedded in the larger and slowly varying elements,the radar precipitation data exhibit distinct multiscale statistical properties,including a non-Gaussian structure and scale-to-scale dependency.An HMT model can capture well the statistical properties of radar precipitation,where the wavelet coefficients in each sub-band are characterized as a Gaussian mixture model(GMM),and the wavelet coefficients from the coarse scale to fine scale are described using a multiscale Markov process.The state probabilities of the GMM are determined using the expectation maximization method,and other parameters,for instance,the variance decay parameters in the HMT model are learned and estimated from high-resolution ground radar reflectivity images.Using the prior model,the wavelet coefficients at finer scales are estimated using local Wiener filtering.The interpolation algorithm is validated using data from the precipitation radar onboard the Tropical Rainfall Measurement Mission satellite,and the reconstructed results are found to be able to enhance the spatial resolution while optimally reproducing the local extremes and gradients.
基金provided by National Science Foundation Center for Advanced Forestry Systems(CAFSAward#1915078)RII Track-2FEC(Award#1920908)。
文摘Tree mortality plays a fundamental role in the dynamics of forest ecosystems,yet it is one of the most difficult phenomena to accurately predict.Various modeling strategies have been developed to improve individual tree mortality predictions.One less explored strategy is the use of a multistage modeling approach.Potential improvements from this approach have remained largely unknown.In this study,we developed a novel multistage approach and compared its performance in individual tree mortality predictions with a more conventional approach using an identical individual tree mortality model formulation.Extensive permanent plot data(n=9442)covering the Acadian Region of North America and over multiple decades(1965–2014)were used in this study.Our results indicated that the model behavior with the multistage approach better depicted the observed mortality and showed a notable improvement over the conventional approach.The difference between the observed and predicted numbers of dead trees using the multistage approach was much smaller when compared with the conventional approach.In addition,tree survival probabilities predicted by the multistage approach generally were not significantly different from the observations,whereas the conventional approach consistently underestimated mortality across species and overestimated tree survival probabilities over the large range of DBH in the data.The new multistage approach also predictions of zero mortality in individual plots,a result not possible in conventional models.Finally,the new approach was more tolerant of modeling errors because it based estimates on ranked tree mortality rather than error-prone predicted values.Overall,this new multistage approach deserves to be considered and tested in future studies.
基金grants from the National Natural Science Foundation of China(No.31870620)the Fundamental Research Funds for the Central Universities(No.PTYX202107)the National Technology Extension Fund of Forestry([2019]06)。
文摘Digital aerial photograph(DAP)data is processed based on Structure from Motion(Sf M)algorithm and regional net adjustment method to generate digital surface discrete point clouds similar to Light Detection and Ranging(LiDAR)and digital orthophoto mosaic(DOM)similar to optical remote sensing image.In this study,we obtained highresolution images of mature forests of Chinese fir by unmanned aerial vehicle(UAV)flying through crossroute flight,and then reconstructed the threedimensional point clouds in the UAV aerial area by SfM technique.The point cloud segmentation(PCS)algorithm was used for the individual tree segmentation,and the F-score of the three sample plots were 0.91,0.94,and 0.94,respectively.Individual tree biomass modeling was conducted using 155 mature Chinese fir forests which were correctly segmented.The relative root mean squared error(rRMSE)values of random forest(RF),bagged tree(BT)and support vector regression(SVR)were 34.48%,35.74%and 40.93%,respectively.Our study demonstrated that DAP point clouds had great potential to extract forest vertical parameters and could be applied successfully in individual tree segmentation and individual tree biomass modeling.
基金supported by a grant from the Education Department of Zhejiang Province (No.Y200803235)
文摘Objective: The progression of human cancer is characterized by the accumulation of genetic instability. An increasing number of experimental genetic molecular techniques have been used to detect chromosome aberrations. Previous studies on chromosome abnormalities often focused on identifying the frequent loci of chromosome alterations, but rarely addressed the issue of interrelationship of chromosomal abnormalities. In the last few years, several mathematical models have been employed to construct models of carcinogenesis, in an attempt to identify the time order and cause-and-effect relationship of chromosome aberrations. The principles and applications of these models are reviewed and compared in this paper. Mathematical modeling of carcinogenesis can contribute to our understanding of the molecular genetics of tumor development, and identification of cancer related genes, thus leading to improved clinical practice of cancer.
文摘Tyre pressure monitoring system(TPMS)is compulsory in most countries like the United States and European Union.The existing systems depend on pressure sensors strapped on the tyre or on wheel speed sensor data.A difference in wheel speed would trigger an alarm based on the algorithm implemented.In this paper,machine learning approach is proposed as a new method to monitor tyre pressure by extracting the vertical vibrations from a wheel hub of a moving vehicle using an accelerometer.The obtained signals will be used to compute through statistical features and histogram features for the feature extraction process.The LMT(Logistic Model Tree)was used as the classifier and attained a classification accuracy of 92.5%with 10-fold cross validation for statistical features and 90.5% with 10-fold cross validation for histogram features.The proposed model can be used for monitoring the automobile tyre pressure successfully.
基金support from Kongju National University Research Grant (2014)
文摘This study was conducted to evaluate the performance of six stem taper models on four tropical tree species, namely Celtis luzonica(Magabuyo),Diplodiscus paniculatus(Balobo), Parashorea malaanonan(Bagtikan), and Swietenia macrophylla(Mahogany) in Mount Makiling Forest Reserve(MMFR), Philippines using fit statistics and lack-of-fit statistics. Four statistical criteria were used in this study, including the standard error of estimate(SEE),coefficient of determination(R^2), mean bias( E),and absolute mean difference(AMD). For the lack-offit statistics, SEE, E and AMD were determined in different relative height classes. The results indicated that the Kozak02 stem taper model offered the best fit for the four tropical species in most statistics. The Kozak02 model also consistently provided the best performance in the lack-of-fit statistics with the best SEE, E and AMD in most of the relative height classes. These stem taper equations could help forest managers and researchers better estimate the diameter of the outside bark with any given height,merchantable stem volumes and total stem volumes of standing trees belonging to the four species of thetropical forest in MMFR.
基金Science and Technology Project of Guangzhou, No. 2002Z3-E4016 No. B30101, China
文摘AIM: To construct tree models for classification of diffuse large B-cell lymphomas (DLBCL) by chromosome copy numbers, to compare them with cDNA microarray classification, and to explore models of multi-gene, multi-step and multi-pathway processes of DLBCL tumorigenesis. METHODS: Maximum-weight branching and distancebased models were constructed based on the comparative genomic hybridization (CGH) data of 123 DLBCL samples using the established methods and software of Desper et al . A maximum likelihood tree model was also used to analyze the data. By comparing with the results reported in literature, values of tree models in the classification of DLBCL were elucidated. RESULTS: Both the branching and the distance-based trees classified DLBCL into three groups. We combined the classification methods of the two models and classified DLBCL into three categories according to their characteristics. The first group was marked by +Xq, +Xp, -17p and +13q; the second group by +3q, +18q and +18p; and the third group was marked by -6q and +6p. This chromosomal classification was consistent with cDNA classification. It indicated that -6q and +3q were two main events in the tumorigenesis of lymphoma. CONCLUSION: Tree models of lymphoma established from CGH data can be used in the classification of DLBCL. These models can suggest multi-gene, multistep and multi-pathway processes of tumorigenesis. Two pathways, -6q preceding +6q and +3q preceding+18q, may be important in understanding tumorigenesis of DLBCL. The pathway, -6q preceding +6q, may have a close relationship with the tumorigenesis of non-GCB DLBCL.
文摘The artificial pure and mixed Korean pine (Pinus koraiensis) forests were investigated at Dailing Forestry Bureau in Xiaoxing'an mountains from 1990 to 1992. Depending on the distance between the samplings of Korean pine and their neighbor trees, the neighbor tree height, the size of neighbor tree canopy, and dimension of neighbor tree. The forest structure was classified into three types: (1) prowth of a tree in the light (open), (2) Growth of a tree in the canopy gap (Gap), (3)Growth of a tree under broad-leaved tree canopy. The frequeney, height, and age of stem divergence of Korean pine tree were investigated by sampling trees. The temporal and spatial model of the tree growth was applied on basis of the height of stem divergence, ratio of height and DBH, and character of tree stem.The morphology and growth character of Korean pine trees during different development stage were forecasted.
文摘Background: A novel approach to modelling individual tree growth dynamics is proposed. The approach combines multiple imputation and copula sampling to produce a stochastic individual tree growth and yield projection system. Methods: The Nova Scotia, Canada permanent sample plot network is used as a case study to develop and test the modelling approach. Predictions from this model are compared to predictions from the Acadian variant of the Forest Vegetation Simulator, a widely used statistical individual tree growth and yield model. Results: Diameter and height growth rates were predicted with error rates consistent with those produced using statistical models. Mortality and ingrowth error rates were higher than those observed for diameter and height, but also were within the bounds produced by traditional approaches for predicting these rates. Ingrowth species composition was very poorly predicted. The model was capable of reproducing a wide range of stand dynamic trajectories and in some cases reproduced trajectories that the statistical model was incapable of reproducing. Conclusions: The model has potential to be used as a benchmarking tool for evaluating statistical and process models and may provide a mechanism to separate signal from noise and improve our ability to analyze and learn from large regional datasets that often have underlying flaws in sample design.
文摘In order to effectively avoid the defects of a traditional discounted cash flow method, a trinomial tree pricing model of the real option is improved and used to forecast the investment price of mining. Taking Molybdenum ore as an example, a theoretical model for the hurdle price under the optimal investment timing is constructed. Based on the example data, the op- tion price model is simulated. By the model, mine investment price can be computed and forecast effectively. According to the characteristics of mine investment, cut-off grade, reserve estimation and mine life in different price also can be quantified. The result shows that it is reliable and practical to enhance the accuracy for mining investment decision.
文摘Background: Growth and yield models are important tools for forest planning. Due to its geographic location, topology, and history of management, the forests of the Adirondacks Region of New York are unique and complex. However, only a relatively limited number of growth and yield models have been developed and/or can be reasonably extended to this region currently. Methods: in this analysis, 571 long-term continuous forest inventory plots with a total of 10 - 52 years of measurement data from four experimental forests maintained by the State University of New York College of Environmental Science and Forestry and one nonindustrial private forest were used to develop an individual tree growth model for the primary hardwood and softwood species in the region. Species-specific annualized static and dynamic equations were developed using the available data and the system was evaluated for long-term behavior. Results: Equivalence tests indicated that the Northeast Variant of the Forest Vegetation Simulator (FVS-NE) was biased in its estimation of tree total and bole height, diameter and height increment, and mortality for most species examined. In contrast, the developed static and annualized dynamic, species-specific equations performed quite well given the underlying variability in the data. Long-term model projections were consistent with the data and suggest a relatively robust system for prediction. Conclusions: Overall, the developed growth model showed reasonable behavior and is a significant improvement over existing models for the region. The model also highlighted the complexities of forest dynamics in the region and should help improve forest planning efforts there.
文摘In the present article it will be critically questioned the traditional entrepreneurship education approaches based on a narrow conception of competency, and their values. Assuming the perspective that to be an entrepreneur is basically an attitude towards life and the world, there proposed holistic, constructivist and experiential processes and strategies for entrepreneurship education. The "entrepreneur XXI", must be able to undertake a social function of change, so, an economical and social development more human, ethical and intelligent. Under this assumption, the "Tree Model for the Development of Entrepreneurial Competencies", that will be discussed globally in the second part of this article, suggests a dynamic and experiential approach ofentrepreneurship education based on the qualification of people's behaviour, self-esteem, competencies and experiences; a profile of key behavioural and performance competencies (root), experimental pedagogical procedures (trunk) and real results within group projects (fruits). This model has been developed during the last decade (2001-2011), using a multidisciplinary research-action procedure, within business, education (at different teaching levels) and social project environments.
文摘A model of using binomial tree pricing formulae in a fuzzy market is proposed. In the fuzzy market, a price interval can be got according to the belief degree. The rule for the reasonability of the price interval is proposed. The explicit expression of the interval is discussed in some special settings.
文摘Scour is a natural phenomenon that is created by the rivers streams or the flood which brings about transferring or eroding of bed materials. To have accurate and safe erosion control structures design, maximum scour depth in downstream of the structures gains specific significance. In the current study, M5 model tree as remedy data mining approaches is suggested to estimate the scour depth around the abutments. To do this, Kayaturk laboratory data (2005), with different hydraulic conditions, are used. Then, the results of M5 model were also compared with genetic programming (GP) and pervious empirical results to investigate the applicability, ability, and accuracy of these procedures. To examine the accuracy of the results yielded from the M5 and GP procedures, two performance indicators (determination coefficient (R2) and root mean square error (RMSE)) were used. The comparison test of results clearly shows that the implementation of M5 technique sounds satisfactory regarding the performance indicators (R<sup>2</sup> = 0.944 and RMSE = 0.126) with less deviation from the numerical values. In addition, M5 tree model, by presenting relationships based on liner regression, has good capability to estimate the depth of scour abutment for engineers in practical terms.
文摘The ongoing effort to create methods for detecting and quantifying fatigue damage is motivated by the high levels of uncertainty in present fatigue-life prediction approaches and the frequently catastrophic nature of fatigue failure.The fatigue life of high strength aluminum alloy 2090-T83 is predicted in this study using a variety of artificial intelligence and machine learning techniques for constant amplitude and negative stress ratios(R?1).Artificial neural networks(ANN),adaptive neuro-fuzzy inference systems(ANFIS),support-vector machines(SVM),a random forest model(RF),and an extreme-gradient tree-boosting model(XGB)are trained using numerical and experimental input data obtained from fatigue tests based on a relatively low number of stress measurements.In particular,the coefficients of the traditional force law formula are found using relevant numerical methods.It is shown that,in comparison to traditional approaches,the neural network and neuro-fuzzy models produce better results,with the neural network models trained using the boosting iterations technique providing the best performances.Building strong models from weak models,XGB helps to predict fatigue life by reducing model partiality and variation in supervised learning.Fuzzy neural models can be used to predict the fatigue life of alloys more accurately than neural networks and traditional methods.
基金funded by the National Key Research and Development Program of China Strategic International Cooperation in Science and Technology Innovation Program (2018YFE0207800)the National Natural Science Foundation of China (31971483)。
文摘The dead fuel moisture content(DFMC)is the key driver leading to fire occurrence.Accurately estimating the DFMC could help identify locations facing fire risks,prioritise areas for fire monitoring,and facilitate timely deployment of fire-suppression resources.In this study,the DFMC and environmental variables,including air temperature,relative humidity,wind speed,solar radiation,rainfall,atmospheric pressure,soil temperature,and soil humidity,were simultaneously measured in a grassland of Ergun City,Inner Mongolia Autonomous Region of China in 2021.We chose three regression models,i.e.,random forest(RF)model,extreme gradient boosting(XGB)model,and boosted regression tree(BRT)model,to model the seasonal DFMC according to the data collected.To ensure accuracy,we added time-lag variables of 3 d to the models.The results showed that the RF model had the best fitting effect with an R2value of 0.847 and a prediction accuracy with a mean absolute error score of 4.764%among the three models.The accuracies of the models in spring and autumn were higher than those in the other two seasons.In addition,different seasons had different key influencing factors,and the degree of influence of these factors on the DFMC changed with time lags.Moreover,time-lag variables within 44 h clearly improved the fitting effect and prediction accuracy,indicating that environmental conditions within approximately 48 h greatly influence the DFMC.This study highlights the importance of considering 48 h time-lagged variables when predicting the DFMC of grassland fuels and mapping grassland fire risks based on the DFMC to help locate high-priority areas for grassland fire monitoring and prevention.