摘要
对分布式网络数据异常结构的优化识别,能够有效解决计算机网络安全问题。对数据的异常数据进行识别,需要结合间隙统计方法,度量参考数据集与观测数据集之间的间隙聚类,完成数据异常结构的优化识别。传统方法得到数据流量中的流量关键点,将数据转换映射到相应关键点后建立直方图,但忽略了对数据集间隙聚类的度量,导致识别精度偏低。提出基于余弦聚类的识别方法。计算每个数据流特征的特征熵和特征比,选取异常结构识别的特征属性向量,根据余弦聚类的方法计算每个数据流特征样本到每个聚类中心的余弦距离,直到获得的聚类中心保持不变或变化较小;最后利用间隙统计方法,度量参考数据集与观测数据集之间的间隙聚类,实现异常结构的识别。实验结果证明,所提方法能够实现分布式网络数据异常结构的准确识别,且能够较好地满足实时识别的需要。
This article proposes a recognition method based on cosine clustering. The characteristic entropy and characteristic ratio of each data flow are calculated, and then the feature attribute vector of abnormal structure recog- nition is selected. After that the cosine distance from feature sample of each data flow to each clustering center is cal- culated based on cosine clustering until clustering center remained constant. Finally, the gap statistical method is used to measure the gap clustering between reference data set and observation data set. Thus, the abnormal structure is identified. Simulation results verify that the proposed method can realize the accurate recognition of anomaly structure of distributed network data, which can satisfy the real-time recognition.
作者
王新强
邓蓓
金诗博
WANG Xin-qiang, DENG Bei, JIN Shi-bo(Tianjin Sino-German University of Applied Sciences, Tianjin 300350, Chin)
出处
《计算机仿真》
北大核心
2018年第5期383-386,共4页
Computer Simulation
关键词
分布式网络
数据
异常结构
优化识别
Distributed network
Data
Abnormal structure
Optimization recognition