期刊文献+

基于自适应密度邻域关系的多标签在线流特征选择

Multi-label Online Stream Feature Selection Based on Adaptive Density Neighborhood Relation
下载PDF
导出
摘要 流特征选择指从以流形式到来的特征数据中选出最优特征子集,现有方法大多在模型训练中需要事先学习领域信息并预设给定参数值。实际应用中,由于不同的数据集数据结构和来源不同,在模型学习过程中研究人员无法提前获取相关领域知识且针对不同类型数据集指定一个统一参数存在巨大挑战。基于此,提出一种基于自适应密度邻域关系的多标签在线流特征选择方法(multi-label online stream feature selection based on adaptive density neighborhood relation,ML-OFS-ADNR),基于邻域粗糙集理论,所提方法在特征依赖计算时无需任何先验领域信息。此外,提出了一种新的自适应密度邻域关系,使用周围实例的密度信息,可以在流特征选择过程中自动选择适当数量的邻域,不需要事先指定任何参数。通过模糊等价约束,ML-OFS-ADNR可以选择高依赖低冗余度的特征。实验表明在10种不同类型的数据集上,所提方法在特征数量相同的情况下优于传统特征选择方法和先进的在线流特征选择方法。 Stream feature selection selects the optimal feature subset from the feature data arriving in the form of stream.Most existing methods require prior learning of domain information and presetting of given parameter values during model training.In real-world applications,due to the differences in data structure and source,researchers cannot obtain relevant domain information in advance during the model learning process for different datasets,and it is a huge challenge for them to specify a unified parameter for different types of datasets.Motivated by this,we propose a multi-label online stream feature selection based on adaptive density neighborhood relation(ML-OFS-ADNR).On the basis of the neighborhood rough set theory,the proposed method does not require any prior domain information in feature dependency calculation.Moreover,a new adaptive density neighborhood relationship is proposed,which can automatically select an appropriate number of neighborhoods in the streaming feature selection process using the density information of surrounding instances,and there is no need to specify any parameters in advance.By the fuzzy equal constraint,ML-OFS-ADNR can select features with high dependency and low redundancy.Experimental studies on ten different types of data sets show that the proposed method is superior to traditional feature selection methods with the same numbers of features and state-of-the-art online streaming feature selection algorithms in an online manner.
作者 张海翔 李培培 胡学钢 ZHANG Hai-xiang;LI Pei-pei;HU Xue-gang(Information Division,The Second People's Hospital of Hefei Affiliated to Bengbu Medical College,Hefei 230012,China;Key Laboratory of Knowledge Engineering with Big Data of Ministry of Education,Hefei University of Technology,Hefei 230601,China)
出处 《计算机技术与发展》 2024年第1期23-29,共7页 Computer Technology and Development
基金 国家自然科学基金资助项目(61976077,62076085,62120106008) 蚌埠医学院科技计划项目(2022byzd225sk)。
关键词 多标签分类 流特征 邻域粗糙集 自适应密度邻域 在线流特征选择 multi-label classification streaming feature neighborhood rough set adaptive density neighborhood online streaming feature selection
  • 相关文献

参考文献4

二级参考文献33

共引文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部