期刊文献+

基于统计机器学习的高维复杂系统建模研究 被引量:3

Research on Regression Modeling of High-Dimensional Complex Systems Based on Statistical Machine Learning
下载PDF
导出
摘要 为了解决"维数灾难"给高维复杂系统数据驱动建模带来的过拟合、计算复杂度高等问题,探究特征提取算法对回归结果的影响。首先使用主成分分析法、非负矩阵分解法、局部线性嵌入法和均匀流形近似与投影法分别进行降维,提取关键特征后在多项式模型与随机响应面模型中进行回归,最后在草炭土土壤和电力系统的数据集上进行了仿真建模分析。仿真结果表明,在草炭土土壤中使用主成分分析法降维后的预测效果最好,在电力系统中使用非负矩阵分解法降维后的概率潮流回归结果最准确。经过分析后可知,主成分分析法适用于具有明显线性相关关系的高维数据集,基于流形的降维算法在本征维数较低的情况下不利于构建相似流形,提取关键特征的效果较差。 In order to solve the problems of over-fitting and high computational complexity brought by “dimensional disaster” to data-driven modeling of high-dimensional complex systems, the influence of feature extraction algorithm on regression results is explored. In this paper, the principal component analysis method, non-negative matrix factorization method, locally linear embedding method, uniform manifold approximation and projection method were used for dimensionality reduction respectively. After extracting the key features, regression was carried out in the polynomial model and stochastic response surface model. Finally, simulation modeling analysis was carried out on data sets of peat soil and power system. The simulation results show that the prediction effect of peat soil is the best after dimension reduction by principal component analysis, and the probability power flow regression is the most accurate in the power system after dimension reduction by nonnegative matrix decomposition. It can be concluded that principal component analysis is suitable for high-dimensional data sets with obvious linear correlation, and the dimensionality reduction algorithm based on manifolds is not conducive to constructing similar manifolds when the intrinsic dimensionality is low, and it is not good at extracting key features on the time.
作者 贾倩倩 郝梦泽 付学谦 JIA Qian-qian;HAO Meng-ze;FU Xue—qian(College of Information and Electrical Engineering,China Agricultural University,Beijing 100083,China)
出处 《计算机仿真》 北大核心 2022年第1期333-338,417,共7页 Computer Simulation
基金 需求侧多能互补优化与供需互动技术北京市重点实验室开放基金资助(YDB51202001976)。
关键词 维数灾难 回归 降维 机器学习 Dimensional disaster Regression Dimension reduction Machine learning
  • 相关文献

参考文献5

二级参考文献81

共引文献151

同被引文献26

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部