摘要
提出基于独立成分分析(ICA)和随机森林判别的Microarray分析方法。该方法先采用独立成分分析获取高阶统计信息,提取Microarray数据特征,达到降维的目的。再应用提取的特征,采用随机森林判别法对样本进行分类。数值分析结果表明,提取5个特征就可以使袋外样本OOB(out of bag)的分类错误率达到7.89%。该方法有效地降低了特征空间维数,具有较高的正确识别率,提高了算法的鲁棒性和灵活性。
In this paper, a microarray analysis method based on independent component analysis and random forests discriminant is provided. In this method the independent component analysis is used to obtain high order statistic information and extract features of microarray in order to reduce the dimension of the feature space. These features extracted were used to classify the samples of out - of -bag (OOB) by random forests discriminant. Numerical simulation shows that the classification error rates of OOB can be up to 7. 89% only by extracted five features. The method can reduce the dimension of feature space effectively and has higher correct classification rate. The results show that it improves robustness and flexibility of algorithms.
出处
《中国优生与遗传杂志》
2009年第8期8-10,共3页
Chinese Journal of Birth Health & Heredity
基金
北京市教育委员会科技发展计划面上项目(KM200910025006)