融合粒度分组与Pareto最优的属性选择

Attribute selection based on granularity grouping and Pareto optimality

导出

摘要利用某一给定度量作为属性评价指标以及启发式算法的约束条件,是大量属性选择方案的关键.然而,属性相似性评价的缺失与朴素的逐个选择机制,使属性遍历存在冗余,故时间消耗巨大.此外,单一度量限制了属性评价视角,难以挖掘出高学习性能的属性.鉴于此,提出一种属性选择框架,其中:1)利用属性粒度及属性间的知识距离对属性分组,组内属性具有明显差异性且组间属性具有较强区分能力,使属性遍历以组为单位,有效压缩候选属性搜索空间,提升属性选择效率;2)利用提出的受限Pareto最优原则,对属性组进行迭代选取,最终得到期望的属性子集.在12组UCI数据集上,通过注入4种不同比例的属性噪声进行实验,结果表明:相较于8种流行方法,所提出方法得到的属性选择结果,在分类稳定性这一指标上平均提升了5.89%,在分类准确率这一指标上平均提升了12.28%,在时间消耗这一指标上平均降低了59.27%. The key to numerous attribute selection methods lies in the utilization of a given measure as the attribute evaluation criterion,along with the constraints of heuristic algorithms.However,the absence of attribute similarity evaluation and the simplistic sequential selection mechanism result in redundant attribute traversal,leading to significant time consumption.Additionally,the use of a single measure limits the perspective of attribute evaluation,making it difficult to unearth attributes with high learning performance.In view of this,a framework for attribute selection is proposed,where:1)Attribute grouping is performed based on attribute granularity and knowledge distance between attributes.Within each group,the attributes exhibit significant differences,while between groups,the attributes possess strong discriminative power.This allows attribute traversal to be conducted at the group level,effectively compressing the search space of candidate attributes and improving attribute selection efficiency.2)The proposed restricted Pareto optimality principle is utilized to iteratively select attribute groups,ultimately obtaining the desired subset of attributes.In experiments conducted on 12 UCI datasets by injecting four different levels of attribute noise,the results show that compared to 8 popular methods,the proposed approach yields attribute selection results with an average improvement of 5.89%in classification stability,an average improvement of 12.28%in classification accuracy,and an average reduction of 59.27%in time consumption.

作者印振宇王平心杨习贝于化龙钱宇华 YIN Zhen-yu;WANG Ping-xin;YANG Xi-bei;YU Hua-long;QIAN Yu-hua(School of Computer,Jiangsu University of Science and Technology,Zhenjiang 212100,China;School of Science,Jiangsu University of Science and Technology,Zhenjiang 212100,China;School of Computer and Information Technology,Shanxi University,Taiyuan 030006,China)

机构地区江苏科技大学计算机科学与工程学院江苏科技大学数理学院山西大学计算机与信息技术学院

出处《控制与决策》 EI CSCD 北大核心 2024年第9期2959-2968,共10页 Control and Decision

基金国家自然科学基金项目(62076111) 江苏省研究生实践创新计划项目(SJCX22_1905)。

关键词属性选择粒度启发式算法启发式信息邻域粗糙集 PARETO最优 attribute selection granularity heuristic algorithm heuristic information neighborhood rough set Pareto optimality

分类号 TP182 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献7

1贾鹤鸣,姜子超,李瑶.基于改进秃鹰搜索算法的同步优化特征选择[J].控制与决策,2022,37(2):445-454. 被引量：43
2冯锋,万喆,徐泽水,柳晓燕.基于软粗糙集的犹豫模糊三支决策方法[J].控制与决策,2023,38(3):834-842. 被引量：6
3李雪岩,李学伟,蒋君.基于知识粒度特征的多目标粗糙集属性约简算法[J].控制与决策,2021,36(1):196-205. 被引量：8
4李军.基于贪心核特征提取方法的中期峰值负荷预测[J].控制与决策,2014,29(9):1661-1666. 被引量：5
5刘兆赓,李占山,王丽,王涛,于海鸿.森林优化特征选择算法的增强与扩展[J].软件学报,2020,31(5):1511-1524. 被引量：9
6张思源,王国胤,刘群,王如琪.基于多粒度特征融合的边缘一致性图像补全[J].控制与决策,2022,37(12):3240-3250. 被引量：2
7李金海,贺建君.多粒度形式背景的不确定性度量与最优粒度选择[J].控制与决策,2022,37(5):1299-1308. 被引量：14

二级参考文献50

1Ghiassia M, Zimbrab D K, Saidane H. Medium term system load forecasting with a dynamic artificial neural network model[J]. Electric Power Systems Research, 2006, 76(5): 302-316.
2Elattar E E, Goulermas J Y, Wu Q H. Electric load fore-casting based on locally weighted support vector regression[J]. IEEE Trans on SMC, 2010, 40(4): 438-447.
3Chen B J, Chang M W, Lin C J. Load forcasting using support vector machines: A study on eunite competition 2001[J]. IEEE Trans on Power Load Systems, 2004, 19(4): 1821-1830.
4Saunders C, Gammerman A, Volk V. Ridge regression algorithm in dual variables[C]. Proc of the 15th Int Conf on Machine Learning. Madison-Wisconsin: Morgan Kaufmann Publishers, 1998: 515-521.
5Scholkopf B, Smola A, Muller K. Nonlinear component analysis as a kernel eigenvalue problem[J]. Neural Computation, 1998, 10(5): 1299-1319.
6Rosipal R. Kernel partial least squares for nnlinear regression and discrimination[J]. Neural Network World, 2003, 13(3): 291-300.
7Rosipal R, Girolami M. An expectation-maximization approach to nonlinear component analysis[J]. Neural Computation, 2001, 13(3): 505-510.
8Franc V, Hlavac V. Greedy algorithm for a training set reduction in the kernel methods[C]. Proc of Computer Analysis of Images and Patterns. Berlin: Springer, 2003: 426-433.
9Franc V. Optimization algorithms for kernel methods[D]. Prague: Department of Cybernetics, Czech Technical University, 2005.
10Sincak P. World-wide competition within the EUNITE network[EB/OL]. (2001-08-05)[2012-11-06]. http://neuron.tuke.sk/competition/index.php.

共引文献79

1王军龙,杨欢红,沈淼,钱慧银,向冠霖,柴磊.IBES算法在并联Boost电路MPPT系统中的应用[J].电子测量技术,2023,46(15):1-9. 被引量：1
2贾鹤鸣,卢程浩,吴迪,李政邦.基于改进的教与学优化算法的船舶实时路径规划[J].船舶工程,2023,45(7):115-123.
3张政国,吴艾玲.最小二乘小波支持向量机在电力负荷预测中的应用[J].兰州交通大学学报,2016,35(4):65-71. 被引量：11
4何耀耀,闻才喜,许启发.基于Epanechnikov核与最优窗宽组合的中期电力负荷概率密度预测方法[J].电力自动化设备,2016,36(11):120-126. 被引量：22
5赵兴昌,张宇献,邢作霞.基于最优窗宽核密度估计的短期负荷区间预测[J].电测与仪表,2019,56(14):56-61. 被引量：10
6刘凯,谭安辉,顾沈明.基于辨识矩阵的不完备多粒度约简[J].模式识别与人工智能,2020,33(9):799-810. 被引量：4
7舒琛洁,梁浩,王耘.基于随机森林算法的中医寒证和热证诊断模型研究[J].北京中医药大学学报,2021,44(6):538-543. 被引量：5
8张勇,陶一凡,巩敦卫.迁移学习引导的变源域长短时记忆网络建筑负荷预测[J].控制与决策,2021,36(10):2328-2338. 被引量：7
9王选,刘祥伟.集成特征选择算法和LightGBM融合的分类模型[J].福建电脑,2022,38(4):12-15. 被引量：2
10何闰丰,黄莺.一种改进支持向量机的电力负荷预测方法研究[J].红水河,2022,41(2):94-99. 被引量：7

1陈楠,杨玻,刘书羽,尉嘉维.联合全局与局部外观特征的无人机行人属性识别[J].信息技术与信息化,2024(2):118-121.
2时代,李金生.企业跨界行为如何提升新产品开发绩效?[J].科学学研究,2024,42(7):1493-1503. 被引量：1
3李华,孟祥瑞.基于哈希桶和聚类的变半径邻域粗糙集模型[J].江苏科技大学学报（自然科学版）,2024,38(4):100-107.
4周静雯.基于属性的测试在单元测试中的应用[J].现代计算机,2024,30(13):86-88.
5袁兆祥,肖智宏,王晶,于燕玲,黄炎,高星乐.考虑多时刻和压缩候选集合的配电网最小化采集优化方法[J].中国电力,2023,56(12):20-30. 被引量：2
6白宏宇,杨帅,袁涔,李玉靖,杨粤,杜江.考虑抗属性篡改的电力调度多径数据加密传输方法[J].电力大数据,2024,27(4):33-38.
7陶颖,刘世蒙,陈英耀.优化陈述性偏好研究的属性确定过程:基于优劣尺度法[J].中国循证医学杂志,2024,24(9):1079-1084.
8陈雁然,孙聪,王艺炜,闫乙豪,原心茹,曹原,石裳,范嘉辉,司天雷,梁少华.不同来源高相似性人乳替代脂的制备及其氧化稳定性分析[J].河南工业大学学报（自然科学版）,2024,45(4):37-45.
9魏梦菲,袁和金.基于Transformer的遥感影像弱监督语义分割[J].软件导刊,2024,23(9):200-208.
10李雪冬.地铁电气系统故障智能检测方法研究[J].中国新技术新产品,2024(16):67-69.

控制与决策

2024年第9期

浏览历史

内容加载中请稍等...

融合粒度分组与Pareto最优的属性选择

参考文献7

二级参考文献50

共引文献79

相关作者

相关机构

相关主题

浏览历史