基于密度权期望最大与分裂合并策略的线状模式挖掘

Line Pattern Mining Based on Density Weight Expectation Maximization and Splitting Merging Strategy

下载PDF

导出

摘要该文针对非线性数据集中线状模式的挖掘问题,提出一种基于密度权期望最大(EM)与分裂合并策略的回归算法。算法基于有限混合模型思想,使用点向式方程定义线状模式表示,将网格密度作为调节权值引入EM过程,有效降低了回归落入局部极值的可能性。同时,引入分裂合并策略,使得算法能够解决连通性问题,并且即使在挖掘数设置与本质线状模式数不相符时也能获得正确结果。实验结果表明,算法对挖掘数设置不敏感,能够正确挖掘出噪声环境下数据集的线状模式。 To address the issue of line pattern mining of non-linear dataset,a new regression algorithm based on density weight Expectation Maximization（EM） and splitting merging strategy is proposed.Point-direction function is first employed to establish the expression of line pattern based on finite mixture model,and grid density is introduced into EM processing as adjust weight,which can effectively reduce the possibility of fall into local optimum of regression.Then a splitting merging strategy is introduced,which ensure the proposed algorithm can overcome the connectivity limitation,and can obtain a correct result even when the number of mining is not set as the same with the real line pattern number.Experiments demonstrate that the proposed algorithm is not sensitive to the set of mining number,and is able to correctly explore the line pattern of non-linear dataset under the noise environment.

作者王力吴成东陈东岳

机构地区东北大学信息科学与工程学院

出处《电子与信息学报》 EI CSCD 北大核心 2012年第5期1162-1167,共6页 Journal of Electronics & Information Technology

基金国家自然科学基金(61005032) 辽宁省自然科学基金(20102062) 沈阳市科学计划项目(F10-147-9-00) 中央高校基本科研业务费项目(N100604018)资助课题

关键词数据挖掘线状模式期望最大化网格密度分裂合并 Data mining Line pattern Expectation Maximization（EM） Grid density Splitting merging strategy

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Reddy C and Aziz M S.Modeling local nonlinear correlationsusing subspace principal curves[J].Statistical Analysis andData Mining,2010,3(5):332-349.
2Kumar S,Ong S H,Ranganath S,et al..Invariant textureclassification for biomedical cell specimens via non-linearpolar map filtering[J].Computer Vision and ImageUnderstanding,2010,114(1):44-53.
3Tong L and Hongbin Z.Riemannian manifold learning[J].IEEE Transactions on Pattern Analysis and MachineIntelligence,2008,30(5):796-809.
4Xiang S M,Nie F P,Pan C H,et al..Regressionreformulations of LLE and LTSA with locally lineartransformation[J].IEEE Transactions on Systems,Man andCybernetics,Part B,2011,41(5):1250-1262.
5Sun Y J,Todorovic S,and Goodison S.Local-learning-basedfeature selection for high-dimensional data analysis[J].IEEETransactions on Pattern Analysis and Machine Intelligence,2010,32(9):1610-1626.
6Donoho D L and Grimes C.Hessian eigenmaps:locally linearembedding techniques for high-dimensional data[J].Proceedings of the National Academy of Sciences of theUnited States of America,2003,100(10):5591-5596.
7Armstrong M A.Basic Topology[M].New York:Springer-Verlag,1997:43-51.
8马江洪,葛咏.图像线状模式的有限混合模型及其EM算法[J].计算机学报,2007,30(2):288-296. 被引量：12
9黎刚果,王正志,王晓敏,倪青山,强波.Linear manifold clustering for high dimensional data based on line manifold searching and fusing[J].Journal of Central South University,2010,17(5):1058-1069. 被引量：1
10Bilmes J.A gentle tutorial on the EM algorithm and itsapplication to parameter estimation for Gaussian mixtureand hidden Markov models[R].Technique Report,ICSI-TR-97-021,University of California,Berkeley,USA,1997.

二级参考文献30

1CHENG Y, CHURCH G M. Biclustering of expression data [C]// Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology. La Jolla, California: AAA1 Press, 2000:93 103.
2YANG J, WANG W, WANG H, YU E δ-clusters: Capturing subspace correlation in a large data set [C]// Proceedings of the 18th International Conference on Data Engineering. San Jose, CA: ICDE Press, 2002:517-528.
3HARPAZ R, HARALICK R. Exploiting the geometry of gene expression patterns for unsupervised learning [C]// Proceedings of the 18th International Conference on Pattern Recognition. Hong Kong: IEEE Computer Society Press, 2006: 670-674.
4HARALICK R, HARPAZ R. Linear manifold clustering in high dimensional spaces by stochastic search [J]. Pattern Recognition, 2007, 40(10): 2672-2684.
5DENG Hua, WU Yi-hu, DUAN Ji-an. Adaptive learning with guaranteed stability for discrete-time recurrent neural networks [J]. Journal of Central South University of Technology, 2007, 14(3): 685-690.
6KITTLER J, ILLINGWORTH J. Minimum error thresholding [J]. Pattern Recognition, 1986, 19:41-47.
7AEBERHARD S, COOMANS D, VEL O. The classification performance of RDA [R]. North Queensland: James Cook University of North Queensland, 1992:92 101.
8SHAPIRA M, SEGAL E, BOTSTEIN D. Disruption of yeast forkhead-associated cell cycle transcription by oxidative stress [J]. Mol Biol Cell, 2004, 15(12): 5659-5669.
9TROND B, BJARTE D, INGE J. LSimpute: Accurate estimation of missing values in microarray data with least squares methods [J]. Nucleic Acids Research, 2004, 32(3): e34.
10AGRAWAL R, GEHRKE J, GUNOPULOS D, RAGHAVAN E Automatic subspace clustering of high dimensional data [J]. Data Mining and Knowledge Discovery, 2005, 11(1): 5-33.

共引文献11

1徐志刚,赵祥模,宋焕生,雷涛,韦娜.基于直方图估计和形状分析的沥青路面裂缝识别算法[J].仪器仪表学报,2010,31(10):2260-2266. 被引量：64
2宋凯,郭纯宏,苏杭,周静,刘振环.实时人眼定位算法研究与设计[J].沈阳理工大学学报,2010,29(5):1-4.
3王力,吴成东,陈东岳,李孟歆,陈莉.非线性流形上的线性结构聚类挖掘[J].自动化学报,2012,38(8):1308-1320. 被引量：3
4管涛.统计聚类模型研究综述[J].计算机科学,2012,39(7):18-24. 被引量：7
5管涛,李玲玲.高斯混合模型、求解算法及视觉应用综述[J].中国图象图形学报,2012,17(12):1461-1471. 被引量：12
6肖澜岚.有关网络安全的态势感知系统研究[J].大观周刊,2013(8):345-345.
7钱丙益,帅斌,陈崇双,李静.基于混合回归模型的客运专线旅客市场细分研究[J].铁道运输与经济,2014,36(1):60-65. 被引量：8
8田杰,韩冬,胡秋霞,马孝义.基于PCA和高斯混合模型的小麦病害彩色图像分割[J].农业机械学报,2014,45(7):267-271. 被引量：23
9郭小芳,李锋,宋晓宁,王卫东.基于连续域混合蚁群优化的核模糊C-均值聚类算法研究[J].模式识别与人工智能,2014,27(9):841-846. 被引量：5
10李登朝,吴健,许凯.基于自适应高斯混合模型的遥感影像分类方法研究——以武汉地区遥感影像分类为例[J].资源环境与工程,2015,29(6):1014-1021.

1马江洪,葛咏.图像线状模式的有限混合模型及其EM算法[J].计算机学报,2007,30(2):288-296. 被引量：12
2姜静,曹彦.基于四叉树和特征融合的图像特征提取的研究[J].洛阳师范学院学报,2014,33(11):55-56. 被引量：3
3杜芳华,冀俊忠,赵学武,吴晨生.基于特征映射的半监督文本分类算法[J].北京工业大学学报,2016,42(2):230-235. 被引量：5
4黄猛,唐琳,胡世安,张搏.一种改进的分裂合并图像分割算法[J].现代电子技术,2009,32(22):102-105. 被引量：3
5叶楠,吕勇哉.模式识别在状态估计中的应用——一类软测量技术[J].仪器仪表学报,1988,9(4):368-374. 被引量：6
6黄宁宁,贾振红,余银峰,杨杰,庞韶宁.改进的FCM与局部信息相结合的图像分割[J].计算机应用与软件,2011,28(8):97-99. 被引量：4
7詹婧,刘新华,魏然.无线传感器/执行器网络的连通性与覆盖问题的研究进展[J].信息系统工程,2012,25(12):134-137.
8陈华.VirtualBox网络模式分析[J].机电信息,2010(18):26-26. 被引量：2
9文贡坚,王润生.一种有效的实现分裂合并算法的数据结构[J].中国图象图形学报（A辑）,1998,3(7):544-548.
10展益彬,林大钧,安琦.反馈型颜色分割在边缘检测中的应用[J].华东理工大学学报（自然科学版）,2010,36(1):146-149.

电子与信息学报

2012年第5期

浏览历史

内容加载中请稍等...

基于密度权期望最大与分裂合并策略的线状模式挖掘

参考文献13

二级参考文献30

共引文献11

相关作者

相关机构

相关主题

浏览历史