系统仿真学报 ›› 2021, Vol. 33 ›› Issue (3): 679-689.doi: 10.16182/j.issn1004731x.joss.19-0573

• 仿真支撑平台/系统技术 • 上一篇    下一篇

面向不平衡数据的网络流量异常检测方法

董书琴1,2, 张斌1,2   

  1. 1.战略支援部队信息工程大学,河南 郑州 450001;
    2.河南省信息安全重点实验室,河南 郑州 450001
  • 收稿日期:2019-11-01 修回日期:2020-01-17 出版日期:2021-03-18 发布日期:2021-03-18
  • 作者简介:董书琴(1990-),男,博士,讲师,研究方向为网络安全态势感知。E-mail:dongshuqin377@126.com
  • 基金资助:
    河南省基础与前沿技术研究计划(142300413201),信息工程大学新兴科研方向培育基金(2016604703),信息工程大学科研团队发展基金(2019f3303)

Network Traffic Anomaly Detection Method for Imbalanced Data

Dong Shuqin1,2, Zhang Bin1,2   

  1. 1. SSF Information Engineering University, Zhengzhou 450001, China;
    2. Henan Key Laboratory of Information Security, Zhengzhou 450001, China
  • Received:2019-11-01 Revised:2020-01-17 Online:2021-03-18 Published:2021-03-18

摘要: 针对小流量攻击样本稀少导致特征提取准确性低进而影响检测性能的问题,提出一种面向不平衡数据的网络流量异常检测方法。设计流量异常检测模型:变换堆叠降噪自编码器(Stacked Denoising Autoencoder,SDA)激活函数、结构、噪声比例及dropout率,学习不同特征空间流量特征,解决单一空间小流量攻击特征提取准确性低的问题;设计批标准化算法,采用Adam算法训练SDA参数,提取多样性流量特征;联合所提特征对Softmax进行训练,提高小流量攻击检测精度。实验结果表明:相比随机森林、单SDA和现有特征融合方法,所提方法分类准确率和小流量攻击检测率较高,且检测性能稳定。

关键词: 异常检测, 不平衡流量分类, 深度学习, 堆叠降噪自编码器

Abstract: Aiming at the poor detection performances caused by the low feature extraction accuracy of rare traffic attacks from scarce samples, a network traffic anomaly detection method for imbalanced data is proposed. A traffic anomaly detection model is designed, in which the traffic features in different feature spaces are learned by alternating activation functions, architectures, corrupted rates and dropout rates of stacked denoising autoencoder (SDA), and the low accuracy in extracting features of rare traffic attacks in a single space is solved. A batch normalization algorithm is designed, and the Adam algorithm is adopted to train parameters of SDAs to extract multifarious traffic features. The Softmax classifier is trained by combining the extracted features, so that the rare traffic attacks can be detected with a high detection precision. The experimental results show that, compared with the methods based on random forest, single SDA and feature fusion, the proposed method has higher classification accuracy, higher detection rate of rare traffic attacks and the detection performances are stable.

Key words: anomaly detection, imbalanced traffic classification, deep learning, stacked denoising autoencoder

中图分类号: