针对卡方自动交互诊断(CHAID)决策树易过拟合的问题,提出CHAID随机森林方法(CHAID Random Forest,CHAID-RF)。该方法采用随机采样、随机选择特征以及集成的策略,将CHAID决策树作为基分类器,形成CHAID-RF。为了验证CHAID-RF的有效性,选取...针对卡方自动交互诊断(CHAID)决策树易过拟合的问题,提出CHAID随机森林方法(CHAID Random Forest,CHAID-RF)。该方法采用随机采样、随机选择特征以及集成的策略,将CHAID决策树作为基分类器,形成CHAID-RF。为了验证CHAID-RF的有效性,选取CART、CHAID、SVM、RF作为对比算法,以准确率、加权查准率、加权查全率、加权F值作为分类模型评价指标,以均方根误差作为回归模型评价指标,采用10个分类数据集和7个回归数据集进行验证。实验结果表明CHAID-RF可行有效。展开更多
As the“engine”of equipment continuous operation and repeated operation, equipment maintenance support plays a more prominent role in the confrontation of symmetrical combat systems. As the basis and guide for the pl...As the“engine”of equipment continuous operation and repeated operation, equipment maintenance support plays a more prominent role in the confrontation of symmetrical combat systems. As the basis and guide for the planning and implementation of equipment maintenance tasks, the equipment damage measurement is an important guarantee for the effective implementation of maintenance support. Firstly,this article comprehensively analyses the influence factors to damage measurement from the enemy’s attributes, our attributes and the battlefield environment starting from the basic problem of wartime equipment damage measurement. Secondly, this article determines the key factors based on fuzzy comprehensive evaluation(FCE) and performed principal component analysis (PCA) on the key factors. Finally, the principal components representing more than 85%of the data features are taken as the input and the equipment damage quantity is taken as the output. The data are trained and tested by artificial neural network (ANN) and random forest (RF). In a word, FCE-PCA-RF can be used as a reference for the research of equipment damage estimation in wartime.展开更多
由于烧结过程中存在众多不确定性因素,使得机理分析和点预测结果的可靠性不足.基于此提出随机森林-极限树-核密度估计(random forest-extreme tree-kernel density estimation,RF-ET-KDE)算法对物理指标(粒度、水分)进行区间预测.首先,...由于烧结过程中存在众多不确定性因素,使得机理分析和点预测结果的可靠性不足.基于此提出随机森林-极限树-核密度估计(random forest-extreme tree-kernel density estimation,RF-ET-KDE)算法对物理指标(粒度、水分)进行区间预测.首先,采用数据预处理和特征选择操作筛选出最适合建模的特征变量.其次,使用基于Stacking的RF-ET算法对指标进行点预测,该算法使得模型有较高的准确性和泛化性.然后,采用KDE算法计算指标的预测误差,得到了一定置信水平下的分布区间和区间预测结果.最后,用所建模型与其余组合模型进行对比.结果表明,RF-ET算法有较高的点预测效果,KDE算法可以很好地量化指标的误差,可以得到较高可靠度的区间预测结果.展开更多
文摘针对卡方自动交互诊断(CHAID)决策树易过拟合的问题,提出CHAID随机森林方法(CHAID Random Forest,CHAID-RF)。该方法采用随机采样、随机选择特征以及集成的策略,将CHAID决策树作为基分类器,形成CHAID-RF。为了验证CHAID-RF的有效性,选取CART、CHAID、SVM、RF作为对比算法,以准确率、加权查准率、加权查全率、加权F值作为分类模型评价指标,以均方根误差作为回归模型评价指标,采用10个分类数据集和7个回归数据集进行验证。实验结果表明CHAID-RF可行有效。
文摘As the“engine”of equipment continuous operation and repeated operation, equipment maintenance support plays a more prominent role in the confrontation of symmetrical combat systems. As the basis and guide for the planning and implementation of equipment maintenance tasks, the equipment damage measurement is an important guarantee for the effective implementation of maintenance support. Firstly,this article comprehensively analyses the influence factors to damage measurement from the enemy’s attributes, our attributes and the battlefield environment starting from the basic problem of wartime equipment damage measurement. Secondly, this article determines the key factors based on fuzzy comprehensive evaluation(FCE) and performed principal component analysis (PCA) on the key factors. Finally, the principal components representing more than 85%of the data features are taken as the input and the equipment damage quantity is taken as the output. The data are trained and tested by artificial neural network (ANN) and random forest (RF). In a word, FCE-PCA-RF can be used as a reference for the research of equipment damage estimation in wartime.
文摘由于烧结过程中存在众多不确定性因素,使得机理分析和点预测结果的可靠性不足.基于此提出随机森林-极限树-核密度估计(random forest-extreme tree-kernel density estimation,RF-ET-KDE)算法对物理指标(粒度、水分)进行区间预测.首先,采用数据预处理和特征选择操作筛选出最适合建模的特征变量.其次,使用基于Stacking的RF-ET算法对指标进行点预测,该算法使得模型有较高的准确性和泛化性.然后,采用KDE算法计算指标的预测误差,得到了一定置信水平下的分布区间和区间预测结果.最后,用所建模型与其余组合模型进行对比.结果表明,RF-ET算法有较高的点预测效果,KDE算法可以很好地量化指标的误差,可以得到较高可靠度的区间预测结果.