基于KMUS-RF算法的复杂产品关键质量特性识别研究

Research on Identification of Critical-to-Quality Characteristics of Complex Products Based on KMUS-RF Algorithm

下载PDF

导出

摘要复杂产品生产数据具有高维度、不平衡的特点,为在复杂产品的生产阶段有效识别关键质量特性,及时进行质量控制,论文提出了一种基于聚类欠采样的改进随机森林算法(Random forest algorithm base on K-Means clustering under sampling,KMUS-RF),利用K-Means算法对多数样本进行聚类,并根据聚类结果进行多次欠采样形成多个平衡数据集,以随机森林为基分类器进行识别,最终根据分类过程中的特征重要性输出关键质量特性集。算例表明,KMUS-RF算法相比现有的多种分类器有良好的整体分类性能,并能显著降低复杂产品分类的第二类错误率,满足产品实际生产需求。 The production data of complex products have the characteristics of high dimension and imbalance. In order to effectively identify the critical-to-quality characteristics in the production stage of complex products and timely control the quality, this paper proposes an improved random forest algorithm base on K-Means clustering under sampling(KMUS-RF). K-Means algorithm is used to cluster the majority of samples, and multiple undersampling is performed according to the clustering results to form multiple balanced data sets. The random forest based classifier is used for recognition, and finally the critical-to-quality characteristics set is output according to the feature importance in the classification process. Numerical examples show that KMUS-RF algorithm has good overall classification performance compared with existing classifiers, and can significantly reduce the type Ⅱ error rate of complex product classification, and meet the actual production needs of products.

作者柳嘉昊 LIU Jia-hao(School of Management Science and Engineering,Nanjing University of Finance&Economics,Nanjing 210046,China)

机构地区南京财经大学管理科学与工程学院

出处《中小企业管理与科技》 2021年第30期134-137,共4页 Management & Technology of SME

基金江苏省研究生科研创新计划项目“基于数据挖掘的航空复杂装备产品关键质量特性识别研究”(项目编号:KYCX20_1354)。

关键词关键质量特性不平衡数据随机森林 K-MEANS 第二类错误 critical-to-quality characteristics imbalanced data random forest K-Means type Ⅱ error

分类号 F273.2 [经济管理—企业管理]

引文网络
相关文献

参考文献6

1李伯虎.复杂产品制造信息化的重要技术——复杂产品集成制造系统[J].中国制造业信息化（应用版）,2006(7):18-23. 被引量：25
2张健,方宏彬.剪枝与欠采样相结合的不平衡数据分类方法[J].计算机应用研究,2012,29(3):847-848. 被引量：4
3何益海,唐晓青,王美清.产品设计质量数据与管理模型研究[J].计算机集成制造系统,2006,12(8):1161-1166. 被引量：7
4闫伟,何桢,李岸达.基于CEM-IG算法的复杂产品关键质量特性识别[J].系统工程理论与实践,2014,34(5):1230-1236. 被引量：13
5于志忠.利用QFD方法建立基于顾客满意的质量目标[J].中国认证认可,2010(11):35-37. 被引量：1
6李岸达,何桢,何曙光.基于NSGA-Ⅱ的非平衡制造数据关键质量特性识别[J].系统工程理论与实践,2016,36(6):1472-1479. 被引量：8

二级参考文献57

1李伯虎.复杂产品制造信息化的重要技术——复杂产品集成制造系统[J].中国制造业信息化（应用版）,2006(7):18-23. 被引量：25
2WEISS G M. Mining with rarity:a unifying framework[ J]. SIGKDD Explorations ,2004,6( 1 ) :7-19.
3KUBAT M, MATWIN S. Addressing the curse of imbalanced training sets : one sided selection [ C ]//Proc of the 14th International Confe- rence on Machine Learning. 1997:179-186.
4YEN S J, LEE Y S. Cluster-based under-sampling approaches for im- balanced data distrlbutions[J]. Expert Systems with Applications, 2009,36(3 ) :5718-5727.
5JAPKOWICZ N. The class imbalance problem: significance and stra- tegies [ C ]//Proc of International Conference on Artificial Intelli- gence. 2000.
6JAPKOWICZ N. Concept-learning in the presence of between-class and within class imbalances[ C]//Proc of the 14th Conference of the Canadi- an Society for Computational Studies of Intelligence. 2001:67-77.
7CHAWLA N V ,BOWYER K W, HALL L O,et al. SMOTE:synthetic minority over-sampling technique [ J]. ,Journal of Artificial Intelli- gence Research,2002,16 ( 1 ) :321 - 357.
8CHAWLA N V t LAZAREVIC A, HALL.O. SMOTEBoost : improving prediction'of the minority class in boosting.[ C ]//Proc of the 7th Euro- pean Conference on Principles and Practice of Knowledge Discovery in Databases. Berlin : Springer,2003 : 107-119.
9张钹,张铃.问题求解理论及应用--商空间粒度计算理论及应用[M].2版.北京:清华大学出版社,2007.
10FREUND Y, SCHAPIRE R. Experiments with a new boosting algo- rithm[ C ]//Proc of the 13th International Conference on Machine Learning. 1996:148-156.

共引文献50

1王宁,闫娜,徐友真,杨剑锋.复杂多工序制造过程关键质量特性识别[J].统计与决策,2021(8):177-180. 被引量：6
2阎菲,李培良.汽车制造业动态供应链信息管理技术研究[J].中国管理信息化（综合版）,2007,10(5):4-7.
3阎菲,向郑涛,李培良.基于网格的企业价值生产控制Agent模型应用研究[J].机床与液压,2007,35(6):54-57.
4阎菲,李培良.基于网络扩展企业关键绩效管理控制模型研究[J].中国管理信息化（综合版）,2007,10(7):6-8.
5阎菲,陈刚.基于数据仓库汽车零部件失效诊断研究[J].微计算机信息,2007,23(05S):225-226.
6李新.复杂产品系统模型研究[J].合作经济与科技,2009(11):103-104. 被引量：10
7张根保,纪富义,任显林,葛红玉,张淑慧.复杂机电产品关键质量特性提取模型[J].重庆大学学报（自然科学版）,2010,33(2):8-14. 被引量：11
8闫伟,何桢,田文萌,何曙光.基于IG的复杂产品关键质量特性识别[J].工业工程与管理,2012,17(1):70-74. 被引量：9
9陈力姝.让小型飞机更安全[J].国外科技动态,2000(1):34-34.
10闫伟,何桢,田文萌.复杂产品关键质量特性识别方法[J].工业工程,2012,15(3):75-79. 被引量：3

1彭一川,李崇奕,王可,邢莹莹.基于权重的欠采样提升算法识别激进驾驶员[J].武汉理工大学学报（交通科学与工程版）,2021,45(2):195-201. 被引量：3
2李芳,郭璞.装配过程的智能感知研究[J].信息技术与信息化,2021(9):240-243.
3Ulagapriya Krishnan,Pushpa Sangar.A Rebalancing Framework for Classification of Imbalanced Medical Appointment No-show Data[J].Journal of Data and Information Science,2021,6(1):178-192.
4固体制剂工艺的创新已成大势所趋[J].流程工业,2021(9):3-3.
5QU Le’an,LI Manchun,CHEN Zhenjie,ZHI Junjun.A Modified Self-adaptive Method for Mapping Annual 30-m Land Use/Land Cover Using Google Earth Engine:A Case Study of Yangtze River Delta[J].Chinese Geographical Science,2021,31(5):782-794. 被引量：2
6周婉迪,裘雪玲,曾永玲,罗新元.水表在线校准及量值期间核查方法探讨[J].工业计量,2021,31(5):17-20. 被引量：1

中小企业管理与科技

2021年第30期

浏览历史

内容加载中请稍等...

基于KMUS-RF算法的复杂产品关键质量特性识别研究

参考文献6

二级参考文献57

共引文献50

相关作者

相关机构

相关主题

浏览历史