The identification of hepatitis C virus(HCV)virus-human protein interactions will not only help us understand the molecular mechanisms of related diseases but also be conductive to discovering new drug targets.An incr...The identification of hepatitis C virus(HCV)virus-human protein interactions will not only help us understand the molecular mechanisms of related diseases but also be conductive to discovering new drug targets.An increasing number of clinically and experimentally validated interactions between HCV and human proteins have been documented in public databases,facilitating studies based on computational methods.In this study,we proposed a new computational approach,rotation forest position-specific scoring matrix(RF-PSSM),to predict the interactions among HCV and human proteins.In particular,PSSM was used to characterize each protein,two-dimensional principal component analysis(2DPCA)was then adopted for feature extraction of PSSM.Finally,rotation forest(RF)was used to implement classification.The results of various ablation experiments show that on independent datasets,the accuracy and area under curve(AUC)value of RF-PSSM can reach 93.74% and 94.29%,respectively,outperforming almost all cutting-edge research.In addition,we used RF-PSSM to predict 9 human proteins that may interact with HCV protein E1,which can provide theoretical guidance for future experimental studies.展开更多
Background: Forest ecosystems are increasingly seen as multi-functional production systems, which should provide, besides timber and economic benefits, also other ecosystem services related to biological diversity, r...Background: Forest ecosystems are increasingly seen as multi-functional production systems, which should provide, besides timber and economic benefits, also other ecosystem services related to biological diversity, recreational uses and environmental functions of forests. This study analyzed the performance of even-aged rotation forest management (RFM), continuous cover forestry (CCF) and any-aged forestry (AAF) in the production of ecosystem services. AAF allows both even-aged and uneven-aged management schedules. The ecosystem services included in the analyses were net present value, volume of harvested timber, cowberry and bilberry yields, scenic value of the forest, carbon balance and suitability of the forest to Siberian jay. Methods: Data envelopment analysis was used to derive numerical efficiency ratios for the three management systems. Efficiency ratio is the sum of weighted outputs (ecosystem services) divided by the sum of weighted inputs. The linear programing model proposed by Charnes, Cooper and Rhodes was used to derive the weights for calculating efficiency scores for the silvicultural systems. Results and conclusions: CCF provided more ecosystem services than RFM, and CCF was more efficient than RFM and AAF in the production of ecosystem services. Multi-objective management provided more ecosystem services (except harvested timber) than single-objective management that maximized economic profitability. The use of low discount rate (resulting in low cutting level and high growing stock volume) led to better supply of most ecosystems services than the use of high discount rate. RFM where NPV was maximized with high discount rate led to particularly poor provision of most ecosystem services. In CCF the provision of ecosystem services was less sensitive to changes in discount rate and management objective than in RFM.展开更多
Acute leukemia is an aggressive disease that has high mortality rates worldwide.The error rate can be as high as 40%when classifying acute leukemia into its subtypes.So,there is an urgent need to support hematologists...Acute leukemia is an aggressive disease that has high mortality rates worldwide.The error rate can be as high as 40%when classifying acute leukemia into its subtypes.So,there is an urgent need to support hematologists during the classification process.More than two decades ago,researchers used microarray gene expression data to classify cancer and adopted acute leukemia as a test case.The high classification accuracy they achieved confirmed that it is possible to classify cancer subtypes using microarray gene expression data.Ensemble machine learning is an effective method that combines individual classifiers to classify new samples.Ensemble classifiers are recognized as powerful algorithms with numerous advantages over traditional classifiers.Over the past few decades,researchers have focused a great deal of attention on ensemble classifiers in a wide variety of fields,including but not limited to disease diagnosis,finance,bioinformatics,healthcare,manufacturing,and geography.This paper reviews the recent ensemble classifier approaches utilized for acute leukemia gene expression data classification.Moreover,a framework for classifying acute leukemia gene expression data is proposed.The pairwise correlation gene selection method and the Rotation Forest of Bayesian Networks are both used in this framework.Experimental outcomes show that the classification accuracy achieved by the acute leukemia ensemble classifiers constructed according to the suggested framework is good compared to the classification accuracy achieved in other studies.展开更多
文摘The identification of hepatitis C virus(HCV)virus-human protein interactions will not only help us understand the molecular mechanisms of related diseases but also be conductive to discovering new drug targets.An increasing number of clinically and experimentally validated interactions between HCV and human proteins have been documented in public databases,facilitating studies based on computational methods.In this study,we proposed a new computational approach,rotation forest position-specific scoring matrix(RF-PSSM),to predict the interactions among HCV and human proteins.In particular,PSSM was used to characterize each protein,two-dimensional principal component analysis(2DPCA)was then adopted for feature extraction of PSSM.Finally,rotation forest(RF)was used to implement classification.The results of various ablation experiments show that on independent datasets,the accuracy and area under curve(AUC)value of RF-PSSM can reach 93.74% and 94.29%,respectively,outperforming almost all cutting-edge research.In addition,we used RF-PSSM to predict 9 human proteins that may interact with HCV protein E1,which can provide theoretical guidance for future experimental studies.
文摘Background: Forest ecosystems are increasingly seen as multi-functional production systems, which should provide, besides timber and economic benefits, also other ecosystem services related to biological diversity, recreational uses and environmental functions of forests. This study analyzed the performance of even-aged rotation forest management (RFM), continuous cover forestry (CCF) and any-aged forestry (AAF) in the production of ecosystem services. AAF allows both even-aged and uneven-aged management schedules. The ecosystem services included in the analyses were net present value, volume of harvested timber, cowberry and bilberry yields, scenic value of the forest, carbon balance and suitability of the forest to Siberian jay. Methods: Data envelopment analysis was used to derive numerical efficiency ratios for the three management systems. Efficiency ratio is the sum of weighted outputs (ecosystem services) divided by the sum of weighted inputs. The linear programing model proposed by Charnes, Cooper and Rhodes was used to derive the weights for calculating efficiency scores for the silvicultural systems. Results and conclusions: CCF provided more ecosystem services than RFM, and CCF was more efficient than RFM and AAF in the production of ecosystem services. Multi-objective management provided more ecosystem services (except harvested timber) than single-objective management that maximized economic profitability. The use of low discount rate (resulting in low cutting level and high growing stock volume) led to better supply of most ecosystems services than the use of high discount rate. RFM where NPV was maximized with high discount rate led to particularly poor provision of most ecosystem services. In CCF the provision of ecosystem services was less sensitive to changes in discount rate and management objective than in RFM.
文摘Acute leukemia is an aggressive disease that has high mortality rates worldwide.The error rate can be as high as 40%when classifying acute leukemia into its subtypes.So,there is an urgent need to support hematologists during the classification process.More than two decades ago,researchers used microarray gene expression data to classify cancer and adopted acute leukemia as a test case.The high classification accuracy they achieved confirmed that it is possible to classify cancer subtypes using microarray gene expression data.Ensemble machine learning is an effective method that combines individual classifiers to classify new samples.Ensemble classifiers are recognized as powerful algorithms with numerous advantages over traditional classifiers.Over the past few decades,researchers have focused a great deal of attention on ensemble classifiers in a wide variety of fields,including but not limited to disease diagnosis,finance,bioinformatics,healthcare,manufacturing,and geography.This paper reviews the recent ensemble classifier approaches utilized for acute leukemia gene expression data classification.Moreover,a framework for classifying acute leukemia gene expression data is proposed.The pairwise correlation gene selection method and the Rotation Forest of Bayesian Networks are both used in this framework.Experimental outcomes show that the classification accuracy achieved by the acute leukemia ensemble classifiers constructed according to the suggested framework is good compared to the classification accuracy achieved in other studies.