期刊文献+

随机森林理论浅析 被引量:142

A Brief Theoretical Overview of Random Forests
下载PDF
导出
摘要 随机森林是一种著名的集成学习方法,被广泛应用于数据分类和非参数回归。本文对随机森林算法的主要理论进行阐述,包括随机森林收敛定理、泛化误差界以和袋外估计三个部分。最后介绍一种属性加权子空间抽样的随机森林改进算法,用于解决超高维数据的分类问题。 Random Forests is an important ensemble learning method and it is widely used in data classification and nonparametric regression. In this paper, we review three main theoretical issues of random forests, i.e., the convergence theorem, the generalization error bound and the out-of-bag estimation. In the end, we present an improved Random Forests algorithm, which uses a feature weighting sampling method to sample a subset of features at each node in growing trees. The new method is suitable to solve classification problems of very high dimensional data.
出处 《集成技术》 2013年第1期1-7,共7页 Journal of Integration Technology
关键词 随机森林 数据挖掘 机器学习 random forests data mining machine learning
  • 相关文献

参考文献19

  • 1Quinlan J R. Induction of decision trees[J].Machine Learning,1986,(01):81-106.
  • 2BREIMAN L,FREIDMAN J,OLSHEN R. Classification and regression trees[M].Belmont(CA):Wadsworth International Group,1984.358.
  • 3Quinlan J R.C4.5:Programs for Machine Learning,1993.
  • 4Cortes C,Vapnik V. Support-vector networks[J].Machine Learning,1995,(03):273-297.
  • 5BREIMAN L. Random forests[J].{H}Machine Learning,2001,(45):5-32.
  • 6Breiman L. Bagging predictors[J].Machine Learning,1996,(02):123-140.
  • 7Ho T. The random subspace method for constructing decision forests[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,(08):832-844.doi:10.1109/34.709601.
  • 8Chen X,Liu M. Prediction of protein-protein interactions using random decision forest framework[J].Bioinformatics,2005,(24):4394-4400.doi:10.1093/bioinformatics/bti721.
  • 9Pang H,Datta D,Zhao H. Pathway analysis using random forests with bivariate node-split for survival outcomes[J].Bioinformatics,2010,(02):250-258.
  • 10Ward M,Pajevic S,Dreyfuss J. Short-term prediction of mortality in patients with systemic lupus erythematosus:classification of outcomes using random forests[J].Arthritis and Rheumatism,2006,(01):74-80.doi:10.1002/art.21695.

二级参考文献39

  • 1陈诗一.汇率预测:一个新的非参数支持向量回归方法[J].数量经济技术经济研究,2007,24(5):142-150. 被引量:14
  • 2AGGARWAL R, DEMASKEY A. 1997. Using derivatives in major currencies for cross - hedging currency risks in Asian emerging markets [ J ]. Journal of Futures Markets, 17:781 - 796.
  • 3BANZ R, BREEN W. 1986. Sample -dependent results using accounting and market data: some evidence [ J]. Journal of Finance,41:779 -793.
  • 4CAMPBELL J. 1987. Stock returns and the term structure [ J]. Journal of Financial Economics, 18:373 - 399.
  • 5CHAN K, CHEN N, HSIEH D. 1985. An exploratory investigation of the firm size effect [ J]. Journal of Financial Economics, 14:451 -471.
  • 6CHEN A S, LEUNG M T, DAOUK H. 2003. Application of neural networks to an emerging financial market: forecasting and trading the Taiwan Stock Index[ J]. Computers & Operations research, 30:901 -923.
  • 7CHEN N, ROLL R, ROSS S. 1986. Economic forces and the stock market [ J ]. Journal of Business, 59:383 - 403.
  • 8FAMA E, BLISS R. 1987. The information in long - maturity forward rates [ J]. American Economic Review, 77: 680 - 692.
  • 9FAMA E, FRENCH K. 1988. Dividend yields and expected stock returns [ J ]. Journal of Financial Economics, 22:3 -25.
  • 10FAMA E, FRENCH K. 1990. Business conditions and expected returns on stocks and bonds[ J]. Journal of Financial Economics. 25:23 -49.

共引文献80

同被引文献1244

引证文献142

二级引证文献847

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部