Slope stability prediction plays a significant role in landslide disaster prevention and mitigation.This paper’s reduced error pruning(REP)tree and random tree(RT)models are developed for slope stability evaluation a...Slope stability prediction plays a significant role in landslide disaster prevention and mitigation.This paper’s reduced error pruning(REP)tree and random tree(RT)models are developed for slope stability evaluation and meeting the high precision and rapidity requirements in slope engineering.The data set of this study includes five parameters,namely slope height,slope angle,cohesion,internal friction angle,and peak ground acceleration.The available data is split into two categories:training(75%)and test(25%)sets.The output of the RT and REP tree models is evaluated using performance measures including accuracy(Acc),Matthews correlation coefficient(Mcc),precision(Prec),recall(Rec),and F-score.The applications of the aforementionedmethods for predicting slope stability are compared to one another and recently established soft computing models in the literature.The analysis of the Acc together with Mcc,and F-score for the slope stability in the test set demonstrates that the RT achieved a better prediction performance with(Acc=97.1429%,Mcc=0.935,F-score for stable class=0.979 and for unstable case F-score=0.935)succeeded by the REP tree model with(Acc=95.4286%,Mcc=0.896,F-score stable class=0.967 and for unstable class F-score=0.923)for the slope stability dataset The analysis of performance measures for the slope stability dataset reveals that the RT model attains comparatively better and reliable results and thus should be encouraged in further research.展开更多
The epidemiology of HIV-1 varies in different areas of the world, and it is possible that this complexity may leave unique footprints in the viral genome. Thus, we attempted to find significant patterns in global HIV-...The epidemiology of HIV-1 varies in different areas of the world, and it is possible that this complexity may leave unique footprints in the viral genome. Thus, we attempted to find significant patterns in global HIV-1 genome sequences. By applying the rule inference algorithm RIPPER (Repeated Incremental Pruning to Produce Error Reduction) to multiple sequence alignments of Env sequences from four classes of compiled datasets, we generated four sets of signature patterns. We found that these patterns were able to distinguish southeastern Asian from non- southeastern Asian sequences with 97.5% accuracy, Chinese from non-Chinese sequences with 98.3% accuracy, African from non-African sequences with 88.4% accuracy, and southern African from non-southern African sequences with 91.2% accuracy. These patterns showed different associations with subtypes and with amino acid positions. In addition, some signature patterns were characteristic of the geographic area from which the sample was taken. Amino acid features corresponding to the phylogenetic clustering of HIV-1 sequences were consistent with some of the deduced patterns. Using a combination of patterns inferred from subtypes B, C, and all subtypes chimeric with CRF01_AE worldwide, we found that signature patterns of subtype C were extremely common in some sampled countries (for example, Zambia in southern Africa), which may hint at the origin of this HIV-1 subtype and the need to pay special attention to this area of Africa. Signature patterns of subtype B sequences were associated with different countries. Even more, there are distinct patterns at single position 21 with glycine, leucine and isoleucine corresponding to subtype C, B and all possible recombination forms chimeric with CRF01_AE, which also indicate distinct geographic features. Our method widens the scope of inference of signature from geographic, genetic, and genomic viewpoints. These findings may provide a valuable reference for epidemiological research or vaccine design.展开更多
基金supported by the National Key Research and Development Plan of China under Grant No.2021YFB2600703.
文摘Slope stability prediction plays a significant role in landslide disaster prevention and mitigation.This paper’s reduced error pruning(REP)tree and random tree(RT)models are developed for slope stability evaluation and meeting the high precision and rapidity requirements in slope engineering.The data set of this study includes five parameters,namely slope height,slope angle,cohesion,internal friction angle,and peak ground acceleration.The available data is split into two categories:training(75%)and test(25%)sets.The output of the RT and REP tree models is evaluated using performance measures including accuracy(Acc),Matthews correlation coefficient(Mcc),precision(Prec),recall(Rec),and F-score.The applications of the aforementionedmethods for predicting slope stability are compared to one another and recently established soft computing models in the literature.The analysis of the Acc together with Mcc,and F-score for the slope stability in the test set demonstrates that the RT achieved a better prediction performance with(Acc=97.1429%,Mcc=0.935,F-score for stable class=0.979 and for unstable case F-score=0.935)succeeded by the REP tree model with(Acc=95.4286%,Mcc=0.896,F-score stable class=0.967 and for unstable class F-score=0.923)for the slope stability dataset The analysis of performance measures for the slope stability dataset reveals that the RT model attains comparatively better and reliable results and thus should be encouraged in further research.
基金the funding by the Chinese Key National Science and Technology Program in the 12th Five-Year Period, grant 2012ZX10001006-002
文摘The epidemiology of HIV-1 varies in different areas of the world, and it is possible that this complexity may leave unique footprints in the viral genome. Thus, we attempted to find significant patterns in global HIV-1 genome sequences. By applying the rule inference algorithm RIPPER (Repeated Incremental Pruning to Produce Error Reduction) to multiple sequence alignments of Env sequences from four classes of compiled datasets, we generated four sets of signature patterns. We found that these patterns were able to distinguish southeastern Asian from non- southeastern Asian sequences with 97.5% accuracy, Chinese from non-Chinese sequences with 98.3% accuracy, African from non-African sequences with 88.4% accuracy, and southern African from non-southern African sequences with 91.2% accuracy. These patterns showed different associations with subtypes and with amino acid positions. In addition, some signature patterns were characteristic of the geographic area from which the sample was taken. Amino acid features corresponding to the phylogenetic clustering of HIV-1 sequences were consistent with some of the deduced patterns. Using a combination of patterns inferred from subtypes B, C, and all subtypes chimeric with CRF01_AE worldwide, we found that signature patterns of subtype C were extremely common in some sampled countries (for example, Zambia in southern Africa), which may hint at the origin of this HIV-1 subtype and the need to pay special attention to this area of Africa. Signature patterns of subtype B sequences were associated with different countries. Even more, there are distinct patterns at single position 21 with glycine, leucine and isoleucine corresponding to subtype C, B and all possible recombination forms chimeric with CRF01_AE, which also indicate distinct geographic features. Our method widens the scope of inference of signature from geographic, genetic, and genomic viewpoints. These findings may provide a valuable reference for epidemiological research or vaccine design.