期刊文献+

HB-PLS:A statistical method for identifying biological process or pathway regulators by integrating Huber loss and Berhu penalty with partial least squares regression 被引量:1

下载PDF
导出
摘要 Gene expression data features high dimensionality,multicollinearity,and non-Gaussian distribution noise,posing hurdles for identification of true regulatory genes controlling a biological process or pathway.In this study,we integrated the Huber loss function and the Berhu penalty(HB)into partial least squares(PLS)framework to deal with the high dimension and multicollinearity property of gene expression data,and developed a new method called HB-PLS regression to model the relationships between regulatory genes and pathway genes.To solve the Huber-Berhu optimization problem,an accelerated proximal gradient descent algorithm with at least 10 times faster than the general convex optimization solver(CVX),was developed.Application of HB-PLS to recognize pathway regulators of lignin biosynthesis and photosynthesis in Arabidopsis thaliana led to the identification of many known positive pathway regulators that had previously been experimentally validated.As compared to sparse partial least squares(SPLS)regression,an efficient method for variable selection and dimension reduction in handling multicollinearity,HB-PLS has higher efficacy in identifying more positive known regulators,a much higher but slightly less sensitivity/(1-specificity)in ranking the true positive known regulators to the top of the output regulatory gene lists for the two aforementioned pathways.In addition,each method could identify some unique regulators that cannot be identified by the other methods.Our results showed that the overall performance of HB-PLS slightly exceeds that of SPLS but both methods are instrumental for identifying real pathway regulators from high-throughput gene expression data,suggesting that integration of statistics,machine leaning and convex optimization can result in a method with high efficacy and is worth further exploration.
出处 《Forestry Research》 2021年第1期43-55,共13页 林学研究(英文)
基金 NSF Plant Genome Program[1703007 to SL and HW] NSF Advances in Biological Informatics[dbi-1458130 to HW] USDA McIntire-Stennis Fund to HW.
关键词 HUBER PENALTY slightly
  • 相关文献

同被引文献3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部