摘要
针对传统网络流量分类方法要求训练和测试数据分布一致、训练数据充足的假设在实际中难以满足的问题,引入迁移学习理论对其研究,提出从特征属性和样本域内相似性两个角度对网络流量数据优化,改进TrAdaBoost(boosting for transfer learning)算法的权重更新机制使其适应多分类任务,添加抑制因子解决源域权重转移现象。在数据集Moore上对该方法检验,结果表明,当目标样本量不足,该模型能根据相似性迁移,有效避免负迁移问题,相比其它流量分类模型具有更高的分类准确率。
Aiming at the problem that the assumption that the traditional network traffic classification methods require uniform distribution of training and test data with sufficient training data is difficult to be met in practice, the transfer learning theory was introduced. The network traffic data were optimized from the perspectives of feature and similarity. The multi classification task was adapted by adjusting the weight update mechanism of the TrAdaBoost(boosting for transfer learning) algorithm. The source domain weight transfer phenomenon was solved by the inhibitory factors. The method was tested on the Moore. The results show that when the target sample is insufficient, the model can transfer according to similarity, and the negative transfer can be effectively avoided. Compared with other traffic classification models, the model has higher classification accuracy.
作者
程超
郭晨璐
CHENG Chao;GUO Chen-lu(College of Computer Science and Engineering,Changchun University of Technology,Changchun 130000,China)
出处
《计算机工程与设计》
北大核心
2023年第2期349-355,共7页
Computer Engineering and Design
基金
吉林省教育厅基金项目(JJKH20210754KJ)
吉林省科学技术厅基金项目(20200401127GX)。
关键词
相似性
迁移学习
网络流量
数据充足
负迁移
权重衰减
多分类
similarity
transfer learning
network flow
sufficient data
negative transfer
weight attenuation
multi classification