一种基于模特征的增量式张量Tucker分解方法

An incremental tensor tucker decomposition method based on model characteristics

导出

摘要随着数据量的爆炸式增长,边缘计算在大数据处理中的作用愈加重要.现实应用中产生的数据通常建模表示成高阶增量式张量的形式,增量式张量Tucker分解是一种高效挖掘高阶海量数据中隐藏信息的方法.针对传统增量式张量分解忽视张量模特征对分解过程的影响、分解结果不能较好保留原始数据特征的问题,提出一种基于模特征的增量式张量Tucker分解方法 ITTDMC (incremental tensor tucker decomposition based on mode characteristics).首先,用模长增量决定增量因子矩阵更新顺序,以此降低更新顺序带来的重构误差;其次,根据模熵变化比决定增量因子矩阵更新权重,使分解结果更准确保留各模特征;然后,将过往时刻的模特征和更新参数记录在指导张量中,遇到模特征相似的增量数据时直接使用来指导张量中参数的更新,避免重复计算,降低时间开销;最后,在合成和真实数据集上进行大量的实验,实验结果表明ITTDMC在模特征明显的数据集上能显著降低(最高可达29%)增量式张量的重构误差. With the explosive growth of data volume,edge computing plays an increasingly important role in big data processing.In general,the data generated by real applications is modelled and represented as high-order incremental tensors.Recently,the incremental tensor tucker decomposition is deemed as an efficient approach to extract the information inherent in those high-order massive data.As the traditional incremental tensor decomposition usually ignores the influence of tensor model characteristics on the decomposition process,it is rather difficult for the decomposition results to preserve the overall characteristics of the original data.To address these issues,we propose an incremental tensor tucker decomposition method ITTDMC(incremental tensor tucker decomposition based on mode characteristics)based on mode characteristics.First,the update order of the increment factor matrix is determined by the increase of mode length,for reducing the reconstruction error caused by the update order.Next,the update weight of the incremental factor matrix is computed according to the changing ratio of the mode entropy,such that the decomposition results enable to maintain the characteristics of each module more accurately.Furthermore,the previous model characteristics and update parameters are recorded in a guide tensor.When the incremental data with similar model characteristics needs to be processed,the corresponding update parameters of the guide tensor can be directly employed to reduce the computational costs.Extensive experiment results on the synthetic and real data sets exhibit that the ITTDMC can greatly reduce the reconstruction error of incremental tensor(up to 29%)for those data sets with strong model characteristics.

作者渠超洋韩建军 QU Chao-yang;HAN Jian-jun(School of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan 430000,China)

机构地区华中科技大学计算机科学与技术学院

出处《控制与决策》 EI CSCD 北大核心 2024年第7期2431-2437,共7页 Control and Decision

关键词大数据增量式张量 Tucker分解模特征模长增量模熵 big data incremental tensor Tucker decomposition mode characteristics mode length increment mode entropy

分类号 TP301 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献4

1Ting Jia,Yuxia Yang,Xi Lu,Qiang Zhu,Kuo Yang,Xuezhong Zhou.Link Prediction based on Tensor Decomposition for the Knowledge Graph of COVID-19 Antiviral Drug[J].Data Intelligence,2022,4(1):134-148. 被引量：1
2朴勇,江贺,王秀坤.基于张量的XML相似度计算方法[J].控制与决策,2016,31(9):1711-1714. 被引量：2
3代伟,南静.随机权神经网络增量构造学习方法研究进展[J].控制与决策,2023,38(8):2231-2242. 被引量：2
4苑红星,卓雪雪,竺德,刘辉.基于矩阵的混合型邻域决策粗糙集增量式更新算法[J].控制与决策,2022,37(6):1621-1631. 被引量：8

二级参考文献29

1杨臻,邱保志.混合信息系统的动态变精度粗糙集模型[J].控制与决策,2020,35(2):297-308. 被引量：10
2王桐,刘大昕.一种新的混合XML文档聚类方法[J].哈尔滨工程大学学报,2007,28(6):697-701. 被引量：7
3Omidvar, Amin, Mehdi Garakani, et al. Context baseduser ranking in forums for expert finding using Word Netdictionary and social network analysis[J]. InformationTechnology and Management, 2014, 15(1): 51-63.
4A¨?telhadj A, Boughanem M, Mezghiche M. Usingstructural similarity for clustering XML documents[J].Knowledge and Information Systems, 2012, 32(1): 109-139.
5Helmer S, Augsten N, B ¨ohlen M. Measuring structuralsimilarity of semi-structured data based on informationtheoretic approaches[J]. The VLDB J, 2012, 21(5): 677-702.
6Guo Yongming, Chen Dehua, Le Jiagin. Clustering XMLdocuments by combining content and structure[C]. IntSymposium on Information Science and Engineering.Shanghai: IEEE Computer Society, 2008: 583-587.
7Tran Tien, Nayak Richi. A progressive clustering algorithmto group the XML data by structural and semanticsimilarity[J]. Int J of Pattern Recognition and ArtificialIntelligence, 2007, 21(4): 1-23.
8Madani Amina, Omar Boussaid, Djamel Eddine Zegour.Semi-structured documents mining: A review andcomparison[J]. Procedia Computer Science, 2013,22(2013): 330-339.
9Yoon J, Raghavan V, Kerschberg L. Bitcube: Clusteringand statistical analysis for xml documents[C]. The 13th IntConf on Scientific and Statistical Database Management.Virginia: Fairfax, 2001: 158-167.
10Nadine, Salah Bourennane. Dimensionality reductionbased on tensor modelling for classification methods[J].IEEE Trans on Geoscience and Remote Sensing, 2009,47(4): 1123-1131.

共引文献9

1解书凯,赵红军,李莉娟.基于能耗优化的AVS编解码自适应流媒体系统设计[J].实验室研究与探索,2018,37(8):57-60.
2吴正江,张亚宁,张真,梅秋雨,杨天.拟单层覆盖粗糙集中近似集的增量更新算法[J].计算机工程,2022,48(6):200-206. 被引量：1
3冀俊忠,龙腾,杨翠翠.基于邻域决策粗糙集的脑功能连接生物标记物识别[J].控制与决策,2023,38(4):1092-1100. 被引量：1
4刘扬,王佳祯,汪晓东,张威,李鑫.基于宽度学习系统的短期电力负荷预测方法研究[J].电力大数据,2023,26(7):23-31. 被引量：1
5薛亚龙,刘梓泞,王质淳.基于粗糙集理论的数据情报侦查决策研究[J].贵州警察学院学报,2024,36(2):86-95.
6陈占伟,胡晓.基于MEC边缘云的智慧商城数据更新控制算法[J].计算机仿真,2024,41(2):477-481.
7王尧,邵晶晶,宋云奎.基于决策粗糙集模型的电网多源异构数据整合[J].电子设计工程,2024,32(10):140-144.
8曾华鑫,吴伟志.不协调广义多尺度决策系统的最优尺度组合的层次关系[J].控制与决策,2024,39(6):2041-2050.
9杨金龙,董绍江,牟小燕.基于随机LSTM块映射特征提取的旋转机械故障诊断方法[J].陕西科技大学学报,2024,42(4):142-153.

1高顺意,赵杰.超分子主客体作用调控电化学熵变[J].科学通报,2024,69(10):1251-1252.
2陆森良,冯宝,徐坤财,陈业航,陈相猛.自适应迁移鲁棒特征的个性化联邦医学图像分类[J].中国图象图形学报,2024,29(3):798-810.
3张义定,雷锦志.异质性干细胞增殖过程中的熵变化[J].生物信息学,2024,22(1):58-69.
4胡斯乐,董立国,白晓雄,许浩,安钰,万海霞,韩新生,王月玲,李妍,郭永忠,余旋.黄土丘陵区典型土地利用类型土壤-微生物量及其生态化学计量特征[J].水土保持学报,2024,38(3):298-305.
5王东炜,刘柏辰,韩志,王艳美,唐延东.基于低秩分解和向量量化的深度网络压缩方法[J].计算机应用,2024,44(7):1987-1994.
6SangSeok Lee,HaeWon Moon,Lee Sael.Block Incremental Dense Tucker Decomposition with Application to Spatial and Temporal Analysis of Air Quality Data[J].Computer Modeling in Engineering & Sciences,2024,139(4):319-336.
7陈孝国,肖修鸿,苏锦棱,丁一鸣,陈楚楚.基于TOPSIS法的直觉模糊熵几何构造方法[J].湖州师范学院学报,2024,46(4):15-22.
8孟灿,翁巍,庞泰,赵蕾,华全斌.基于改进残差网络的企业融资信息网络异常流量入侵检测方法[J].网络安全技术与应用,2024(7):41-43. 被引量：1
9林冲,闫文君,纪纲,于斌,王莹.深度神经网络参数轻量化方法综述[J].中国电子科学研究院学报,2024,19(4):350-363.
10杨鹏,刘亮,张磊,刘林,李子强,贾凯.一种基于强化学习的软件安全实体关系预测方法[J].四川大学学报（自然科学版）,2024,61(4):163-171.

控制与决策

2024年第7期

浏览历史

内容加载中请稍等...

一种基于模特征的增量式张量Tucker分解方法

参考文献4

二级参考文献29

共引文献9

相关作者

相关机构

相关主题

浏览历史