期刊文献+

一种适合大规模数据集的特征选择方法 被引量:1

A Feature Selection Method Fitting for Large Data Set
下载PDF
导出
摘要 研究训练样本重要特征选择问题,提出了一种适合大规模数据集的特征选择方法。在不同的样本空间中利用特征相似性和浮动搜索方法的思想选择特征,基于互信息和分类准确度加权选择分类器,提出了基于Bagging选择性组合算法来提高特征选择算法稳定性。采用KDD Cup’99中的入侵检测数据对算法性能进行了验证。 This paper researches on problems of selecting important features and proposes a feature selection method fitting for large data set, selects feature subset using feature similarity, the idea of floating search method, and classifiers with the help of mutual information and accuracy weight, and propose a Bagging-based selective result ensemble algorithm to improve the algorithm stability. Intrusion detection data of KDD Cup'99 to validate the performance of algorithm is introduced.
作者 张莉 陈恭和
出处 《计算机工程》 CAS CSCD 北大核心 2007年第4期184-186,共3页 Computer Engineering
关键词 特征选择 特征相似性 浮动搜索 选择性集成 Feature selection Feature similarity Floating search Selective ensemble
  • 相关文献

参考文献9

  • 1Kwak N,Chong-Ho.Input Feature Selection for Classification Problems[J].IEEE Transaction on Neural Network,2002,13(1):143-157.
  • 2Mitra P,Murthy C A,Pal S K.Unsupervised Feature Selection Using Feature Similarity[J].IEEE Trans.on Pattern Analysis and Machine Intelligence,2002,24(3):301-312.
  • 3Dash M.Feature Selection via Set Cover[C]//Proceedings of the 1997IEEE Knowledge and Data Engineering Exchange Workshop.1997:165-171.
  • 4Zhang Li,Sun Gang,Guo Jun.Feature Selection for Pattern Classification Problem[C]//Proc.of the 4^th International Conference on Computer and Information Technology.2004:233-237.
  • 5Kononenko.Estimating Attributes:Analysis and Extension of Relief[C]//Proceedings of the European Conference on Machine Learning.1994:171-182.
  • 6Setiono R,Liu H.Neural Network Feature Selector[J].IEEE Tran.on Neural Network,1997,18(3):654-662.
  • 7Pudil P,Novovicova J,Kittler J.Floating Search Methods in Feature Selection[J].Pattern Recognition Letters,1994,15(11):1119-1125.
  • 8Strehl A,Ghosh J.Cluster Ensembles-A Knowledge Reuse Framework for Combining Partitioning[C]//Proc.of the 18^th National Conf.on Artificial Intelligence.Menlo Park:AAAI Press,2002:93-98.
  • 9唐伟,周志华.基于Bagging的选择性聚类集成[J].软件学报,2005,16(4):496-502. 被引量:95

二级参考文献14

  • 1Estivill-Castro V. Why so many clustering algorithms-A position paper. SIGKDD Explorations, 2002,4(1):65-75.
  • 2Dietterich TG. Machine learning research: Four current directions. AI Magazine, 1997,18(4):97-136.
  • 3Breiman L. Bagging predicators. Machine Learning, 1996,24(2):123-140.
  • 4Zhou ZH, Wu J, Tang W. Ensembling neural networks: Many could be better than all. Artificial Intelligence, 2002,137(1-2):239-263.
  • 5Strehl A, Ghosh J. Cluster ensembles-A knowledge reuse framework for combining partitionings. In: Dechter R, Kearns M,Sutton R, eds. Proc. of the 18th National Conf. on Artificial Intelligence. Menlo Park: AAAI Press, 2002. 93-98.
  • 6MacQueen JB. Some methods for classification and analysis of multivariate observations. In: LeCam LM, Neyman J, eds. Proc. of the 5th Berkeley Symp. on Mathematical Statistics and Probability. Berkeley: University of California Press, 1967,1:281-297.
  • 7Blake C, Keogh E, Merz CJ. UCI Repository of machine learning databases. Irvine: Department of Information and Computer Science, University of California, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html
  • 8Modha DS, Spangler WS. Feature weighting in k-means clustering. Machine Learning, 2003,52(3):217-237.
  • 9Zhou ZH, Tang W. Clusterer ensemble. Technical Report, Nanjing: AI Lab., Department of Computer Science & Technology,Nanjing University, 2002.
  • 10Fern XZ, Brodley CE. Random projection for high dimensional data clustering: A cluster ensemble approach. In: Fawcett T, Mishra N, eds. Proc. of the 20th Int'l Conf. on Machine Learning. Menlo Park: AAAI Press, 2003. 186-193.

共引文献94

同被引文献12

  • 1MOUELHI-CHIBANI W, PIERREVAL H. Training a neural net- work to select dispatching rules in real time [J]. Computers & In- dustrial Engineering, 2010, 58(2): 249 - 256.
  • 2PRIORE P, GOMEZ A, PINO R, et al. Dynamic scheduling of man- ufacturing systems using machine learning: An updated review [J]. Artificial Intelligence for Engineering Design, Analysis and Manu- facturing, 2014, 8(1): 83 - 97.
  • 3KIM Y D, SHIM S O, CHOI B, et al. Simplification methods for ac- celerating simulation-based real-time scheduling in a semiconductor wafer fabrication facility [J]. IEEE Transactions on Semiconductor Manufacturing, 2003, 16(2): 290- 298.
  • 4PARK Y J, HWANG H R. A rule-based simulation approach to scheduling problem in semiconductor photolithography process [C] //The 2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA). Morocco: IEEE, 2013:1 - 4.
  • 5SHIUE Y R, GUH R S, TSENG T Y. Study on shop floor control system in semiconductor fabrication by self-organizing map-based intelligent multi-controller [J]. Computers & Industrial Engineer- ing, 2012, 62(4): 1119- 1129.
  • 6TSAI C J, HUANG H P. A real-time scheduling and rescheduling system based on RFID for semiconductor foundry FABs [J]. Jour- nal of the Chinese Institute of Industrial Engineers, 2007, 24(6): 437 - 445.
  • 7OLAFSSON S, LI X N. Learning effective new single machine dis- patching rules from optimal scheduling data [J]. International Jour- nal of Production Economics, 2010, 128(1): 118 - 126.
  • 8LI L, SUN Z J, NI J C, et al. Data-based scheduling framework and adaptive dispatching rule of complex manufacturing systems [J].The International Journal of Advanced Manufacturing Technology, 2013, 66(9/10/11/12): 1891 - 1905.
  • 9QIAO F, MAY M, GU X. Attribute selection algorithm of data- based scheduling strategy for semiconductor manufacturing [C] //IEEE International Conference on Automation Science and Engi- neering. USA: IEEE, 2013, 8: 410- 413.
  • 10MAY M, CHEN X, QIAO F, et al. The research and application of a dynamic dispatching strategy selection approach based on bpso- SVM for semiconductor production line [C] //The 11th IEEE In- ternational Conference on Networking, Sensing and Control (1C- NSC2014). Miami Florida, USA: IEEE, 2014, 4:74 - 79.

引证文献1

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部